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METHODS OFDIAGNOSIS OF BREAST CANCER) COMPOSITIONS 
^VNTHMEraODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 

5 CROSS-REFERENCES TO RELATED APPLICATIONS 

This applicationclauns priority to USSN 60/263,965, ffledJanu^ 24^01; 

n 9nni • USSN 09/829,472 filed April 9, 2001; USSN 
USSN 60/265,928, filed February 2, 2001, USSN uy/o/y, * 
60/282698,medApnl9,2001;USSN60/288,590,medMay4,2001;andUSSN 

entirety. 

FIELD OF THE INVENTION 
Theinventionr.lat.stotaidenBficationof^leicaoidandprotein 

,5 andnudeic acids, product and antibodies thereto ma. »*-*-» 

ptosis and therapy of breas, cance, The invention tether rotates to methods for 
identifying and using agent, and/or targets that inhibit breast oanoor. 
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BACKGROUND OF THE INVENTION 
Breast cancer is one of tho moat frequent* diagnoaed concern and the second 
teadina cause of female cancer death m North America and northern Europe, with lung 

in eight with a l-in-29 lifeume risk of dytag from breas, cancer. Early dCechon of breast 
^fulg myography, climca, breast —on, and setf breas,—on, has 
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hormone therapy, fln fllHnduce serio us side effects wliich diminish quality of life, 

treated, the current therapies can all induce serious siac 

parsers and markers (Picons e t a,., 2000, Arch. Patho.. Lab. Med. 124.966*78, 
5 Lmillon and Piccart, 2000, Ann. Oncol. 11:647-663). induding geneoc pr^apo 
.naHcersBRCA-, and BRCA-2 (Robson, 2000, J. Clin. Oncol. ^« 

toagtag of breast cancer for diagnosis ha. been problems aod Inmted. In 
addibon. dia.mina.ion of hrmor ceils (melasUses) to locoregional lymph nodes is an 

0 l „hnodeme.as^es,o4Mo 5 0perceminmosep a hen teW hodohavelymphnode 

liases. Arecen.aepo.ahowedma.mlerome^escaobede.ec.edhomlymphnodes 
using reverse transcriptase-PCR methods baaed on the presence of mRNA for 
^inoembryonic antigen, which has previous* been shown to be present nr the vas 
n.ajo^ofbreastoancersbntnotinno-.issnea. U.fer.e.at.N.wEngUndJ.ofMrf. 

15 339(4):223 (1998). 

The identification of novel therapeotic target and diagnosoc markers . 

secular medicine have increased me interest in mmor-speoific eel, snrface anugens « 
lm serve as targets for varioos immunomerapeutio or sma„ mo!ecule strategtes. Anhgens 

ideally no, express* in norma, adnl, tissues. Expression in fissues m are to 
MabLver.maybe.ol^ed.Examp.esofsuchaofigenamelndeHerZ/nenandmeB-oel. 

antiaen CD20. Hnmanized monelonal anubodies directed to Hert/neu 

25 (Rosa and Fletcher, 1998, Stem CeUs 16:413^28). SimiUr,y, 

IbomesCRimturr^MbaxtaabJarensedoaeffeo.ivdyhoa.non.Hodge^ 

^oney Z, 1997, B.ood 90:2 18 8-2.95; U g « and C_ ,993, Core. Optn. Onoo,. 

,0:548-551). togets bavo been identified for breast 

30 eancer.Onesuch«ar g e t ispolymon.hicepimeUa 1 mucin(MUC 1 ).MUC 1 U a bansmembrane 
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antigenic^ dtsunC -** - - » ^ , — r. 

ah, 2 000, din. Cancer fces * <~ ^ ^ „ „ ^ . ta 

The ^ Chem. 43:585-593). However, many patients 

tumor marker, CA 15-3 n orMUC-l; therefore, it is clear that other targets 

to ,e tumors that express nether HER2 nor MU , ^ 
..eeatoheiaennnen.manage— , 1987> ^ce, 

1( 8547):1398-1402), c-erbB3 (N»du e, 2000 _ 

Anticancer Res. 20:2255-2264), MTAl(Nawac.al., 200^ IOM 

, 1 nr„* * *l 2000 Breast Cancer Res. Treat. r>y.<u to;- 

genes have been reported (for review, see Green et al., 2000, Caneer 

, • wo QQ/33869, WO-97/25426, WO- 
cancer are fonnd in international patent applications WO-99/33St>y, 

oil hwo 00/55173 WO-98/45328andWO-00/22130. Similarly, genes and ESTs 
97/02280 and WO-00/551/J, wu yoiuj* 

5 €93 522. The utility of such genes is described in each ot tnes p 

B elucidation of a role for novel proteins ana compound m ^e ^ 

tatervendon in breast arm other cane*, ^^J< „ mettods ^ can 

hensed in diagnosis and prognosis of breast cancer. Further provided 
beuseotnu ^ i- modulate breast cancer. 

30 be used to screen candidate btoachvc agents forthe ability to 
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SUMMARY OF THE INVENTION 
Thepresentiuveutionthereforeprovidesaucleotide sequences ofgeaesthat 

areup-anddown^^ 

v . -r-.^^o^^nrir rnmoounds that modulate breast 

nnrnoses and also as targets tor screening i»i 

"lashormonesor bodies. Other aspects ofthe invention wUI become apparent 
to the skilled artisan by the following description of the invention. 

cancer-associate 

biological sample from the patient with a polynucleotide that selectively hybnchzes to a 

sequence at least 80% identical to a sequence as shownin Tables 1-25. 

• inoneembodiment.thepre.entmventionprovidesamemodofdetermmmg 

the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detectmg a 
breast cancer-associated transcript in a cell from a patient, the method comprising contact 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the biological sample is a tissue sample. In another 
embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat 
breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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sequence at least 80% identical to a sequence as shown m Tables 1 25. in y 

faon ee mb odn 1 ent, A eme M nn1hereun 1 prUesu,es«epof:(nOoompam g 

treatment. t - 

! 5 Additionally, provided herein is a method of evaluating the effect of a 

candidatebreast cancer d^co-^-H-i^^^^^^T^* 
cell sample from the patient. The expression profile ofthecell is then detenmned. This 
methodmay further comprise comparmg the expression proffle to an exp^ 
healthy individual. Inapreferred embodiment, said expression profile includes a gene of 
20 Tables 1-25. ^ ^ ^ ^ ^ ^ ^ ^ ^ 

consisting of a polynucleotide sequence as shown in Tables 1-25. 
lnoneembodiment,a^ 

25 aCld ' m one aspect, the present invention provides an isolated polypeptide which is 

encodedby anucleic acid molecule having polynucleotide sequence as shown m Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
15 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 

effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) aciministering a test compound to a mammal having breast cancer 
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or ,„ a cel. senile isolated therefrom; W «•*«• ■» '" d ° f eXpreSS1 °" ° f ' ce 
as shown in Tables 1-25 in a beat* eell or mamma. with me leva, of gene °— 

I„ one embodiment, the control is a mammal with breast oaneer or a oell 
sample therefrom tot has not been treated with the test compound. In another embodunent, 

the control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 

described herein. 

In another aspect, the present invention provides a pharmaceutical 
composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
) comparing the level of expression in the absence of the drug candidate to the level of 

7 



PCT/US02/02242 
WO 02/059377 



expression in the presence of the drug candidate, whereto the concentration of &eto^ 
Ididate can vary when present, and whereto the comparison can occur after « 

^ rfto d^«^I..^^«-*-* to0-1 ' , ^ , " ,,ta,,t,,0 
expression profile genea. The profde genes may show an increase or decrease, 
expresstonp J ^^.e^t of a candidate breaa, cancer 

the bresst cancer modulatory prcteta, or an animal lacking the breast cancer modulatory 

protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucletc acid 

0 segments of Tables 1-25, whereto the biochip comprises fewer man 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 

nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of ageneofTables 1-25, 
15 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

~ in one embodiment a method for screening for a bioactive agent capable of 

interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combining a 
25 breast cancer modulatory protein or fragment'thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 

30 Preferably, the agent inhibits breast cancer. 

■ 8 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutical^ acceptable carrier. In 
another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 
protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having breast cancer an antibody to a breast 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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rt «n 11 10 1* ir 19 20 21, and 22 list those genes that are 
samples. Tables 4, 5, 6, 7, 8, ^ 14 lhte those genes that are highly upregnlated in 

5 x^T^alaopn.vid.^^^-^^^"^^ 
that is part of the unigene cluster. 

Definitions „ r ".wart 

The term "breast cancer protein" or "breast cancer polynucleotide or breast 

l0 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic vanant, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence mat has 
greater than about 60% nucleotide sequence identity, 65%, 70% 15%, S0%, 85* 90 A 
preferably 91* 92%, 93%, 94%, 95%, 96% 97% 98% or 99% or greater nuclide . 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100 200, 
15 500 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an nmnunogen 
compnsing an amino acid sequence encoded by a nucleotide sequence of or associated wrth a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybndize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 
20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an ammo acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80% 85% 90% preferably 91% 92o/o, 93% 94% 95% 96% 97% 98% or 99% or greater 
annuo sequence identity, preferably over a region of over a region of at least about 25, 50, 
100 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 
25 sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from amammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide » include both naturally occurring or 
recombinant forms. 
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l»lUMigii> , fh „„ f contains all of the elements 

polynucleotide or polypeptide sequences. The folllengtn m y p 

l0 tissues such as biopsy and autopsy samp.es, ftozen sections laaen for 

biologieal sample is typically obtained torn a eutayotic organism, most preferably a 
.namma. such as a primate e.g., ehimpanzee or human; cow, dog; eat; a rodent, e.g., gmnea 

15 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing abiologieal sample" means to obtain abiologieal sample for use m 
methods described in .his invention. Most often, this vriU be done by removing a sample of 
cells from an animal, bnt can also be accomplished by naing previously isolated cells 
is0 U,ed by another person, at another time, and/or for another purpose), or by perfonmng the 

20 methods of fceinvenfionm vivo. Archival tissues, having treatment or outcome history, wll 

be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
Sesame or haveaspecifiedpe^ 

95 o/ 0 96% 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated regron) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
30 web site http://w^^ Such sequences are then sard to 
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^■substantially identical." This definition also refers to, or may be applied to, the 
Ip^orLs^. Thedefioifiooalsoiooludesse.ueooeathatoave^o s 

m d/or additions, as we,, as those that have substitutions, as we„ as nauorally ocoumng, e*. 
^orpbiooraUefiovari^andnran-ntade variant. As described below, me prof^i 
5 ^olnaoanaccountforgapsandfitelik. Preferably, ideofi^ exists over a regron fi»t , 
« ,eas. about 25 amino aeids or nucleotides in length, or more preferably over a regton that ,s 
50-100 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
10 andreferencesequencesareenter^ 

if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 
1 5 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 for comparison are well-known in the art. Optimal alignment of sequences, for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
25 (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al, eds. 

1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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which are described in Mtschul * al. Nuc. Acids Res. 25:3389-3402 (1977) an Altschu ^ 
al J Mat Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 

the invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biote^ ^ 
algorithm involves first identifying high scormg sequence pairs (HSPs) by identifymg short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the . 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
{see Hemkoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see. e.g., Karlin & Altschul, Proc. Nafl. Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
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caparison of the tea. nuc.eic acid «o the reference nocleic acid . less <h« about 0 2, more 

t_ 1 < in ?n ^0 40 40 70, 90, 110, 150, 170, etc, 
be large negative numbers, e.g., 5, 10, 20, 30, 4U, 4U, /w, y , 

i • „i^pn.i m <-« or nolvnent des are substantially 
An indication that two nucleic acid sequences Ox K ~/pe r 

identical is that the polypeptide encoded by the firstnucleic acid is inununologically cross 

acid asdescribedbelow. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
1 5 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian ceUs such as CHO, HeLa, and the like {see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified;- or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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5 Wypeptid.dsode^bessitalv^onsoftonucle.cac.d Oneofslallwt gut 
1^ Lr.ainoon^eachcodoainanuc.eicao^excep.AUO.wUch.sord^.y.he 

X codon for — e, and TOO, which is orcmarily the on,, codon fo, tryptophan) can 

a„nc 1 eioa C idwMchencodesapolypepdde is imp, i o i .,nad e 30 li b=d Se ,nencew,n,respeo,« 0 
10 d.eexpreasionprodnct.bntnDtwin.iespecttoactualptobeseanences. 

As to amino acid sinenses, one of skill will recognize tot mdmdual 
substitutions, deletions or additions to a nueleio acid, peptide, polypeptide, or protem 
sequence which alters, adds or deletes a single amino acid or a small percentage of ammo 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
15 resultaomeaubstimtionofanaminoacidwimachemicanysunUarammoac.d. 

Conservative substitution tables providing functionally similar amino acids are well known m 
the art. Such conservatively moditied variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the Inventiorctypically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartie ac.d (D), 

2 o «w^«id(lO i »*^W^«>ifl*« w " WI « ,taWS m 

^leucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosme 00, 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see. 

e.g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described* 
25 termsofvariouslevelsoforganization. For a general discussion of this organization,**, 
eg. Alberts* a/., Mo/e^^ 

Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypept.de that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of P -sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent ternary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "ohgonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. Anucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
15 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
OUgonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 
30 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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W Ulsinger « al., , Am. Chen, Soc. , .0:4470 (.98,)-, and Pauwels a « 

am 0-memy.phophoroamidtt. Usages (see Eckstein, Ohgonucleottdes and Anabagues. 

n^ea (see Eghohm, f. An,. Chen, Son. 114:1895 <1992);hdeter * ah, ^ 
311MB (1992); Nieisen, Nam™, 365:566 (1993); Carlsson e« al., Nature 380.207 (1996), all 
of which are incorporated hy reference). Ofter analog nncleic acids include those wtth 

Lhones (U.S. Pa,en,No, 5,386,023, 5.637,684, 5,602,240, 5,216, 41 and , 69 8^ 
Kieorowshi e, al., Angcw. Chen,. Ml Ed. Enghsh 30:423 (1991); Lelsmgex el al . Am. 
Chem Soc. 110:4470 (1988); Lclsinger et aL, Nucleoside ^Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Meditations in An«*ense 
Research", M Y.S. Sanghui and P. Dan Cook; Mesmaeker =1 al, Bioorgamc & Medtcmal 
Chem Leu. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedaon Lett. 
37-743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5 235 033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
ModincahonstoAnhsenseReaearch-.Ed.V.S. Sanghui andP.DanCcok. Nucleic acrds 
containing one or more carbocyclic sugats are alao included within one definition of nnclerc 
acids^eeJ.nkinaetal.Chem. Soc. Rev. (1995) pp .69-176). Several nucleic acid analogs 
are described in Rawls,C&ENews June 2, 1997 page 35. All ofthese references are hereby 

expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral condihons, rn 
i contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature flW ** mismatched versus 
perfectly matched basepairs. DNA and KNA typically exhibit a 2-4°C drop in T m for an 
Lternal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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datively insensitive „ sa>< c—ion. In addition, PNAs are no, degraded by eeUolar 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
containportionsofbothdoublestrandedorsingle stranded sequence. As will be appreciated 
5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, SNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribonucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
10 xanthine hypoxanthine, isocytosine, isoguanine, etc. transcript" typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 
15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
20 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al., Nature, 144:945 (1962); David et al., 
25 ^chemistry. 13:1014 (1974); Pain et al., T Tmmunol. Meth, 40:219 (1981); andNygren, L 
Wistochem. and Cvtochem.. 30:407 (1982). 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
5 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 
10 As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 
20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 
25 The term "recombinant" when used with reference, e.g., to a cell,\or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 
30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 

20 



. 502/02242 

WO 02/059377 



10 



a a, aU By the -* " -* 

„ r no, expressed a. aU By the tomaBl pulation „f nudeic acid, eg, ustng 
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acid as depicted above. . 

Thetenn-hererologoua-whennaedwimreferencetoportionaofannolerc 

aaio todies ma. me nucleic acid comprise* two or more subsequences ma, am no, normally 
found to me same relanonsbip to each other to natnte. For instance, the nueletc ac,d ta 
WicaHy recombinant* product having W o or more aeqnences, e.g„ from nnrelatcd genes 
ananged to make a new functional nocleic acid, e.g„ a promoter ftom one source and a 
coding region from another source. Sirnilarfy, a heterologous pronto will often refer to «wo 
o, more subsequences fha, are not found in ft. same mfauonshfp <o each other to nature (e.g, 
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a fusion protein) 



25 



a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
directtranscriptionofanucleicacid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase E type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
t0 a functional linkage between a nucleic acid expression control sequence (such as a 
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- • , t.r binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence 

copending to ft. second s*,u«ee. -** »r 

total cellular or library DN A or RN A). 

Tie phrase "stringent hybridation conditions" refets to condmons nnder 
which a probe wiU hybridize to its targe, subsequence, typieaUy in a complex mixture of 
uucleic acids, M » no other seance, Stringent nonunion, a. sequence^ependen. and 
will be different in different oirenntsfances. Longer sequences hybridize spectflcally at 
higher temped. An extensive gnide ,o the hybridization of nucleic aeids is found tn 

Probes "Overview of principles of hybridization and the strategy of nucleic actd assays 
0993) GeneraUy.strmgentco^htionsareselec.edtobeabou.S-lO'Clowerftanthe 

the temperature (under defined ionic stfength, pH, and nucleic concentmfion) at whtch 50 /. 
of to probes complementary to the Urge, hybridize to the Urge, sequence at erpuhbnunr (as 
fhe target sequences are present in excess, at T„ 50% of the probes are occupted at 
, eouuibriunt). Shlngen.cnndihonawiUbemosemwUchtosal.coneenb^on.leaslhan 
about 1 .0 M sodium ion, typically about 0.0. to 1.0 M sodium ion concentration or otter 
sate) a, pH 7.0 to 8.3 and (he temperature is at leas. abou. 30»C for short probes (=.g., 10 to 
50 nucleotides) and a. leas, abon. 60°C for long probes (e.g, grea.er man 50 nucleottdea). 
Stringent condifiona may also be achieveo with the additton of destobtfizing agents such as 
M fbrmamide. For selective or specific hybridization, a positive signal is at leas, two rimes 
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, u, ■ n «m« s bKlMrovmd hybridization. Exemplary stringent 

tacubatmgat42C,or,5xSSC,l/.> , for , ow sttillgency 

0. 1 %SDS»«65»C.F=rPCR,atemnera t urec t abou.36C, S W. 8 „ c 
am pB g ca«ien,a 1 ,houg ha nneanng.emp«^esn,ayva I ybe W eenebout32C=nd4 C 

Jc is typical, ahhough high stringency anneaung temped can ra^e from *- » C 

bo* high and .o W stringency an.pUncattons mcludeadenateauonphaseof-l C-95 C 
30 see-2mh,, an anneahng phase tasting 30 S ec.-2mn,, and an extension phaseof about 
72-C for 1 - 2 mta. Protocols and guidelines for lo» and high stringency ampuncahon 

Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40o/o formamide, 1 MNaCl, 
1% SDS at 3rC, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al. 

The phrase "functional effects" in the context of assays for testing compounds 
that modulate activity of abreast cancer protein includes the determination of aparameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e g a functional, physical, or chemical effect, such as the ability to decrease breast cancer. I 
includes hgand binding activity; cell growth on soft agar, anchorage dependence; contact 
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growth factor or serum dependence; tumor specific marker levels, tn 
L^wthandm.Ustoais^^mRNAandpro.einexpreasionmcel.sundergtg 

vitro, in vivo, and ex vivo activities. „„„„„„dthat 
By "determining the functional effect" is mean, aasayng for a oompound that 
increases or decreases a parameter M is indirectty or directly under me influ.ee of abreast 

functional effects can be measured by any menus known to mose skilled in tbe art, e.g 
etangcs in speclroscopic characteristics (e.g., fluorescenee, absorbance, refraclive tndex), 
hydrodynamio (e.g., shape), cinematographic, or solubility properties for the protcm, 
measuring inducible markers or transcriptional ac.iv.tton of.be brers, cancer preterm 

curing oellularprolrfemtton. Deterrninatton of.he fnncttonal effect ofacompouM on 
ureas, cancer can also be performed using breas. cancer assays known .o .hose of sk.il m to 
art such as an in vr.ro assays, e.g, cell grown, on soft agar, anchorage dependence; comae, 
tahibitton and density limitation of grown,; cellular proliferation; cellular transformation; 
grow* factor or serum dependence; tumor specific rmrrker levels; invasiveness into Maurgel; 
mntor growti, and metastasis in vivo; mKNA and protein expression in cells undergomg 
me.as.asis, and outer characteristics of breast cancer cells. The functional effects can be 
evaluated by many means known .o .hose skilled in .he art, e.g„ microscopy fur qnantttattve 
or qualitative measures of alterations in morphological features, measurement of changes u> 
BKA or protein levels for breast cancer-associated sequences, measuremen. ofRNA srabrlrty, 
identificatton of downstteam or reporter gene expression (CAT, lucrferase, p-gal, GFP and 
, foe like), e.g., via chennlrunmeacence, fluoresce, colorimetric reactions, antibody brndtrrg, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 
0 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totaUyblock 
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Samples or assays comprising broas. cancer proteins that are Ucated with a 

coated with UN - -igoed a reiahve protein activity valne o 100%. Tntabtbon 
preferably 50%, more preferably 25-0%. Aeuvanonofabreastoanoer polypi e. s 

co^».-"P« 1 7 3 ^ M ^_, retatoaw ^ etace ngrowna a nd 
, 5 The phrase "changes in cell growth relers to any rawus 

button of growth, loss of growth faotor or serum requirements, changes m cell 
morphology, gaining or losing immortalization, gaming or losing tumor speo* : marker,, 
3 „ abiUty to form or suppress brmors when injected into suitable animal boss, and/or 
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inunortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 

Technique pp. 231-241 (3 rd ed. 1994). 

•Tumor cell" refers to precancerous, cancerous, and normal cells m a tumor. 
"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 
to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA or uptake of exogenous DNA, it can also anse 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable tight chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(aby 2 , a 
dimer of Fab which itself is a tight chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
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1993). While various antibody fragments are , d ^ ^ ^ ^ 

chemically or by usmg recomtonant DNA mod iScation of whole 

chain Fv) or toe identified nsing phage draplay hbrar.es (see, g 
0 348:552-554 0990)) f Hlmdies e g recombinant, monoclonal, or po)yelonal 

^preparation KoHer & Milstem, 

^bodies, many techmquetamw.mmeaticanbeused^g 

NaIare 256:495-497 (.975)-, Ko*or „ C, — '*^ 0 L wwk „ 
BdliMwWi** - O-r M 0W. C andGoding, 

p r odnctionof S ingl=chainantibodre S (U.S.Paten.4,946, ) h 

> w«i or exchanged so that the antigen binding site 
^on, or a portion tiaereof, is aUered, replaced . ^ 

antibody, e.g., an e y exch aneed with a variable region having a 

region, or a portion thereof, is altered, replaced or exchanged wi 

different or altered antigen specificity. 



30 27 



PCT/OS02/02242 
WO 02/059377 



10 



15 



„fKr M .t cancer-associated sequences 
sample; while two states may nave «ny parucu-d- g 

cl _ tois tieof,hea ta ,eofthee««. ^-^^ (e ' S -° 0 7 ,bre ;;; 21, or 
M may be MM from — s or metastatic eanoerous ttssue o ftite breaa,, 
^oalr^orm^br^car^us^canbeco^^^ 

obtained. - . u f 

The identification of sequences that are differentially expressed tnbreast 

act to down-regulate breast oaneer, and tints tumor growm or reonrrence, in a pamcular 
patient Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples win. tite known expression profiles. Metsstatio tissue can also be 
a^lyzen to determine tit. stage of breast oaneer in tire tissue. Furtttermore, these gene 
agression profiles (or individual genes) allow screening of drug candidates with an eye to 
poking or altering » particular expression profile; e.g., sereerting can be done for chugs 
ft* suppress tite breas, cancer expression profile. This may be done by making bioohrps 

, these methods can also be done on the protein basis; M is, protein expression levels of the 
bK as, cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
In addition, me breast cancer nucleic acid sequences ean be administered for gene therapy 
purposes, including the administration of antisenae nucleic aeids, or the breas. cancer protetns 
(including antibodies and other modulators thereof) adminiatered as therapeuttc drugs. 
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a. tfre pr.cn, invasion provi.es oucWc acid and proton — = « 

ou.Uned be,ow, breast cancer sequences inc.ude tbose the, are ***** -0 

a Kgber .eve.) in breast cancer, aa waU aa tbose that are donated 0^«- 

however aa will be appreciated by toe into art, breast cancer sequences from otter 

JLr sequences are provided, from v ^bra^,inc.ndingmannnal S>1 nc.ud 1 ngro to U(^, 

h „ K .,e,o.)andpe B ,e. 8 ., (dogs, ca,, etc.). Breast cancer sequences frouaotoorgam^ 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and aimno acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, breast 
leer nucleic acid sequences are useful in a variety of applications, inchuhng diagnose 
apphcations, which will detect naturally occurring nucleic acids, as well as screenmg 
apphcauons;e.g.,bio^^ 

selected probes to the breast cancer sequences can be generated. 

Abreastcancersequencecanbeinitiallyidentifiedbysubstantialnucleicacxd 

and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and » 
generally determined as outlined below, using either homology programs or hybndizauon 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes 

tissue Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states and tissue undergoing metastasis, are applied to biochips comprising nuclei acid 
pro bes The samples are first microdissected, if applicable, and treated as is known in the art 
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.on of mRNA Suitable biochips are commercially available, e.g. from 
forthepreparabonofmBNA. „ generated and tbedata 

Affymetrix. Gene expression profiles as described 

moneembodimen,*e g eness^^^^ 

H tn eenes expressed in other normal tissues, preferably 
non „al and disease states arc compared ^ breast> ^ 

normalbreast.butalsoincludmg.andnotlnnrtedtolung.liean, 

muS cle, colon, small intestine, large intestine, spleen, bone and plac^ 
embodiment, ^W^^^^^^^^^Z^ 

<^^^^~^^^ Mmt r eferable 

that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
elated in breast cancer, that is, the expression of these genes is higher in the breast cancer 
tissueascomparedtonon-canceroustissue. ^-regulation" as used herein often means at 

five-fold or higher being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the access.cn 
numbers are hereby expressly incorporated by reference. GenBank is known m the art, **, 
e g. Benson, DA et al, Nucleic Acids Research 26:1-7 (1998) and 
) h ttp7/www.ncbi.nlm.nih.govA Sequences are also available in other databases, e.g., 

EuropeanMoleeular Biology Laboratory (EMBL) and DNA Database of Japan(DDBJ). 

US Patent ApphcationN. 09/687^ 

further discloses related sequences, compositions, and methods of diagnosis and treatment of 

breast cancer is hereby expressly incorporated by reference. 
5 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue {see. e.g., Tables 1,2, 3, 15, 16 etc..) 
^own-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 
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Mormatics ^ ^ ow m mder eJ[pressed to bre as, cancer 

, v, .iinlar toxicoloeical informatioa can be generated to better direct or B 

«-».fc*M.(-.Oa.I— fcWtW»I> Similaradvantagesaccreeftom 
laets relevant to omex Ho— and bioactive agents <e.g., nucleic -* saccha.dea, 

iaeludea at leas, one se, of assays The daU contained m to da.abasc U accuiced, =.g 

any fonn in which data can be maintained and transmit!,* but is preferably an electron* 
20 database. TheelecttoniedatabaseofdteinvendoncanbemainWnedonanyelecttotnc 
device allowing for Ore storage of and access to tbe database, such as a personal cotnputer, 
M is preferably distributed on a wide area network, such as the World Wide Web. 

The foens of the present section on databases that include peptide sequence 
da.aisforclan^ofiUusuudonon.y.l.wmbeapp^tto^ofs.dlHn^art^ainu.ar 

25 aa^aaeacanbeaaaembledfbranyassayda^acouirednsinganass.yofmeinvenbon 

Tbe compositions and methods for identifying and/or auantanng the relatrve 
rfor absolute abundance of avariety of molecular and macromotalar species ftom a 
MCogical sample nndergoing breast cancer, U. the tdenhncatton of breast cancer-assoctated 
SM „=nces described herein, provide an abundance of information, which canbe correlated 
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for manual review and analysis, in a preferred emoou 

speed computers is utilized. biomo lecular information is 
An array of methods for indexing and retn g 

sequences <o be catalogued and searched according to one or more prorern nurctton 

forob^gfulWengft^uenceaftomttrecoUecdonofp.riaUengarsequences. U.S. 

^^^^d^i^inag^.da.abase^o.thedegroe.f— 
5 between ahey sconce and a targe, sequence. U.S. Pare.. 5,538,897 disdoses «— 
using mass spectroscopy fragmentation patterns of pepttdes *> identify amino acrd sequences 
in computer databases by comparison of predicted mass spectra with experimentaUy^enved 
mass spectra using a closeness-of-fi. measure. U.S. Patent 5,926,8!. discloses a multi- 
dimensional database comprising a functionality for multi-dlmensional data analysts 
,0 described as on-line analytical processing (OLAP), which en^ils me comtolidation of 

pmiectedandacti^damaecomingmmomthanoneeonsotidationpathordimensron. U.S. 

P ate„,5295^61repo rt sahybridda«aba S es1r-cmreinwhichme fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
gelds stored in a hierarchical topological map which can be viewed as a tree structure or as 
25 the merger of two or more such tree structures. 

See also Mount et al, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models ofProteins and Nucleic Acids (Durbin er a/., eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevams & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 
Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 



30 

32 



PCT71JS02/02242 
WO 02/059377 



10 



15 



The present invention provides a computer database v & 
^^^JUineo^— ^asaayd— - 1^ 8 '' 
with dato specifying the source of the targeting samp., from whtoh each sequence 
qnecificitv record was obtained. . . 

^ pl e ista acon« K ,lUs S ue S a»ple^wn to belr«ofpaU>o.o E ic a .di S o rie r S .Ina 
variation, a, leas, one of the sources U aknown pathological tissue specimen, e. g , a 

variation, theassay records tabulate one or memo, the foUowhrg parameter for each 
targe, species in a sample: (1) a nniqne identification code, which can include, e.g„ a target 
molecular structure and/or characteristic separata, coordinate (e.g., electrophoretrc 
coordinates); (2) sample sonree; and (3) absolute and/or relative quantity of fine Urge, speeres 

present in the sample. 

The invention also provides for the storage and retrieval of a collectton of 

targe, data in a computer data a.orage apparatus, which cm include magnetic disks, optical 
disks, magneton! disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
nragnetic bubble memory devices, and other daU storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, fire Urge. dab. records are stored aa a bi. partem 
in an array of magnetic domains on a magnetizable medium oraa an array of charge states . or 
5 transistor gate states, star as an array of cel!s in aDRAM device (e.g., eachceU oompnsed of 
a^ia.orandachargestoragearea.whichmaybeontafianaismr). In one embodiment, 

the invention provides such storage devices, and computer syatema bull, there™*, 
comprising a bi. pattern encoding a protein expression fingerprint recom comprising nmque 
identifiers for a. least 10 targe, data reeorda cross-tabulated with targe, source. 
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... , ^ „»Ude or nucleic acid sequences, compns-ng performing a 

compansonc TP AST A, GAP, BESTFTT) and/or the companson may 

embodiment thereof (e.g., FAST A, TP AST A, OAr, I „, „ f fences 

be of me relafive amount of a peptide or nucleic acid sequence m a pool of sequences 
d.termmedftomapolypepUdcornucleieacid S ampl«ofaspec,m«>. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Window S 95/98/2000, Wtadows NT, OS/2) or other forma, ^ 
U g Linux, SunOS, Solaris, ABC SCO Unix, VMS, MV, Macintosh, etc.) floppy diskelte or 
tad (fixed, Winchester) dish drive, comprising a bit pattern encoding data from an assay of 
ft. invention in a file forma, suitable for rehieva! and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmittmg assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
• degreeofidentityandgapweighttothetargetdata. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of the 
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Executionofthecomputerprogramresultsmth 

SGRAM, or SDRAM). T^^^eda^^^ ^ 
between a selected assay characteristic (eg., binding yQ ^jyjgo. For example, 

aC e ntt a,procc SS orcanbeaconve„ t ,onalco m po.er(=.g.,WelP^^ r 
^rt**— '• UW ^° datase rver,ameoiory 

15 anISDNtermmaladapter.anEU.emetport.apanchedcardreader.am go 

a program for alignment and comparison, typically wiin 
on the basis of computed similarity values. 

^teins membrane protebra or intraceUular protein, In one einbodur**, *e breast 

action and repbeation (including, e.s.,signabngpatbways),abe 
30 ^ortenl.UinunrcgnUtcriordisregularcdccUu.arprocc^^, e.,,MoUcu la r 
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. , , rfed 1994) ForexamplcmanyintmceUulaxproteins 
S^c/^Cd/ (Alberts, «i,3ri^ J Kin phosptat aseacdvity, protease 

hav.cn^c^.-.P^^^ r^eUularproteins 

also serve as docking protems that are rnvolv lved to ma intaining the 

^pro^.ovanoussube.llelarloeah^ons.andare.nvoiv 

a«o„ to the highiy conserved se,uences foon . ^ oKta 

„». Forexamp.e, "'^^^^^ tom SH2 
^etsinaseonencedependentmanner. PTB domarnv 
don^^bind^sta.phosphory.a.edU^.SHSd^^ 

^ hr addiboo, PH domains, tetratriccnephde repeal and TO *-» 
few havebeenahowntomediateprotein-proteminteracuons. Some of these may 
mvolwd in binding to phospholipids or other second messengers. As will be appreciated by 

, o3semlda^aseUPta(pro,emMUes),wMchlsalargeeoUec n onofmnlhple 
V^^viaOre^ftomW— «h^uis« 

(1W8)) ' In ano.bere m bo4imen,urehrcas.oancerse 4 nencesaxe«embrane 
Mttte Tr^en*, m .pn,^-»o.^« fl >a.^an.pho S pno H pidbiUy-of»^ 

C^baveanin^Unlardomam.rmcxn.ceUnlardomam.orbonr. 
30 "of^pro^mayhaveannmberof— Wndingdroseaheadydesenhed 
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. theintraceUoUrdomainmayhave enzyme achvrty 

aad/or may serve as a bmdtng arte for addtti P ^ ^ ^ 

^abroftransmembran e ' roKtaS ; 6 "t;! H2aoTO h K .lnadmtion,aotopho S phorylation 

— — Proteta crea.es oinding si.es for additions. SH2 domain 

of tyrosines on die receptor molecnle .Bell, 

confining proteins. transmembrane 
Transmembrane proteins may contam tan one 

transmembrane domam. However, v» ^ ^ 

^eot^emoatranar^ ^^^.L--—- 
s „chasG protein c«pled receptom (GPCRs) are elas eristicsof 
domam-pro^^nweon^'me^^g^-^ 

may be fotiowed by charged ammo actds *ref„e, ^ J ^ 

within the protein may be predteted free, « P ^ 
tapottan. transmembrane protein receptors tnch.de, but are no. hm. 

• r KWorowU. factor receptor, bnman growth bonnone receptor, gmoo 
receptor, tnsohn-hke growth factor. ' tow density lipoprotein 

"^7— ^don^ofti^embranepmtoina^mve^however. 
ea^arfonndrepeatediyamongvarionsextiacCniar domains. Conserved 

-^t^doTX^ating 
domai^mefbnndon^ Factom^^^ eiaM 

F „ r e*amp,e, growers - -"^^j ^responses. Odaer factors htcinde 
30 bind to their cognate receptors to initiate a variety 
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' . r . ^theHke Extracellular domains also 
tvnicaUvoermeablized to provide access to intmcellular proteins. 

W ^^bear^a^bydroscto^art^atr^ern^pro^- 
6e .nade soluble by removing b—brane sequences, e.g.. through recombinant mettrods. 
Furmermere, rransmembrane proteins mat have beer, made so.uble can bo made to be 
15 secreted tough recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted protetns; the 
secretion of which can be either constitutive or regulated. These proteins have a argna. 
peptide or signal sequence that targets the molecule m the secretory pathway. Seated 

a „^m^er(^onmeo«U«s^to^r),apar M rin.manne t (ac t m 6 
„nce. te mclosepmximity«omeceU,ha,seore,e4fte£ac,or)oranendocrinemanner(aehng 

oncellama^.Thusseore^moleemeammuaemmodulam.gora.temtg — 
aspects of physiology. Breast cancer proteins tiaa, are secreted proteins am particularly 
25 preferred in tite present invention as they serve as good targets for dia^ostic markers, ,g., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identifiedby 

30 substantial 
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sequencesoutlinedherem. ^*~™^ u ^ Ukm , using either homology 
programs or hybridization conditions. Typically, unx 

^^^tt^car.cerouo.e.aoiti^of.tic— , e.,, .he sequences 

====== 

=™=:r=^ 

u .rrnifimeisee httn7/www.ncbi.nim.ntt.gov/UmGmeO- 
gene, e.g., systems such as UnrOene (see, ntrpj™ 

Once the breast canoer nucleic acid U identified, it can be cloned and, rf 
necessary, its constituent pari, recombined to form me entire breast cancer nucleic acrd 

segment, fire recombinant breast cancer nudeic acid can bo Waer-used as a probe to d«tify 
ardisolafeotherbrea™ nucleic acids, e.g., extended codbg region. 1. can also be 
^edasa-'precnrsori-nuo^acid.omakem^fiedorvariantbreas.cancernucle.caords 

and proteins. . , 

The breast cancer nucleic acids of the present invention are nsed m several 
ways .nafirs.embodimen.nncleicaeidpmbes.omebreasteaneernnetaieacida^made 
Jan^hedtobiocmps^bensedmsereemnganddiagnosticmemods.aaonmnedbelow, 
5 or for adminhdrafion, o.g., for gene therapy, vaccine, and/or antisenso appfications. 

Alternatively, the breast cancer nncieic acids mat inctude coding regions of breast cancer 
proteins can be pn. into expression vectors for tire expression of breast cancer proteins, agarn 
for screening purposes or for administration to a patient. 

. to a preferred embodiment, nucleic acid probes to breast cancer nuclere acrds 
l0 (both the nucleic acid sequences outlined in tire figures and/or tire complements thereof) are 
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i • -a nmbes attached to the biochip are designed to be substantially 

Lgle arraaded aueleie acid, of to prese* iuveation. However, ,f «h. uuarber o mafcboas 

aeuueaoea to hybridize under noma! reacuoa eoaditioas, particularly high stirageaey 
conditions, as outlined herein. . 

A nucleie acid probe is generally single stranded but can be partrally single 
and partially double stranded. The strandedness of the probe is dictatedby the structure, 
composition, and properties of the target sequence. In general, the nucldc acid probe, range 
from aboutS to about lOObases long, with from aboutlOto about 80 bases ben* preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used In some embodiments, much longer nucleic acids can be used, up to 

hundreds of bases. ^ 

In a preferred embodiment, more than one probe per sequence rs used, with 
eitheroverlapping probes orprobes to different sections of the target being used. That xs, 
two three four or more probes, with three being preferred, are used to buildin a redundancy 
for particular target. Theprobes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensttmty. 

As will be appreciated by those in the art, nucleic acids can be attached or 
i^obilizedtoasohdsupportmawidevarietyofways. By "immobilized" and grahunatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analyse, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
0 covalentbinding" and grammatical equivalents herein is meant one or more of electrostatic, 
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the biotinylated probe to the streptaviain. y 

covalent and non-covalent interactions. 

ingeneral^eprobesareattachedtothebioehipxnaw.devanetyofways.as 

willbeappreciatedbythoseinthean.Asdescribedherein.thenuc^ 

first, ^.^-^•fcl^--^^' - - 
thebiochip. ^ eb . ocWpcomprisesasdtablesolidsubstrate . By "substrate" or "solid 
5 supporf'orothergraxnmatic^equivalentshereinism^ 

contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limned to, 
glass and modified or functional^ glass, plastics (including acrylics, polystyrene and 
j0 ^polymexsofstvreneandomermaterials.polypropy^ 

polyurethanes,TeflonJ,etc), polysaccharides, nylon or nitrocellulose, resins, silica or sflxca- 
based materials including silicon and modified silicon, carbon, metals, inorgamc glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
25 F luorescentPlasticBiochip,U.S. Application SerialNo. 09/270,214, filedMarch 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those m 
the art other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to nnnimize 
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,hc biocbip is derive with a chemical firncttona, group mcludmg, b 
^ ^oups, curboxy groups, oxo groups ana mio! group, w,th 
parfidy preferred. Using these fictional groups, the probes can he 

iuncuuuoi 5 r i:„vers as are known in the art, e.g., 

attached* surfacescomprisingam.no groups ^^.^ ^ Company 
Homo-orhetero-bifimcfional — - . w~ »» 
catalog technical section on cross-linkers, pages 155-200). in 

^att^omeamW fitesolid support 

an, either the 5' or 3> temunus nray he attached «o I. solid support, or attachment may 

be via an internal nucleoside. 

tnanomc.emh^mchnmobUizaUon.omesoUdsnpportmaybevery 

attong yetnon-covalent For example, bioonylated oligonudeoUdea can be made, whrch 

bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Altemahvely.meo.igonuc.coUdesmaybesyod.eai.edon.heauriace.aa.s 

^undsandtechuioucs arenas Inapreferred embodiment,, he nuclercactda can b 
Tylesized m aim, uaing well .mown photolithographic such as those descnbed 

5 L—lo; WO 95,35505; U.S. Paten. Nos. 5,700,637 and 5,445,934; and referent 
ci ,ed within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix OeueChip™ technology. 

Often, amplification-based aasays are performed to measure the expression 
le ,e, of breast ^-associated sequences. These assays are typically performed m 
30 ^jmrcfioowithreversettanscnpfion. In such assays, a breast eancer-assocrated nuderc actd 
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qoantitative PCR arc provided, e g., m torus « a/., PCS JWaft * 
Applications cofl)od i II1 en«, a TaqMan based assay is used to measure expression 

TaqMau base, assays use a fluomgeuie oligonucleotide probe tha, cuntaina a ,• 
,„ dy and a 3 - quenching agen, The probe bybndizea.0 a PCR product^ U^fbe 

CeavageoftbeTaqManprobe. This cleavage separates the 5- fluorescent dye and the 3 

15 amplifloabon (see, e.g., Hteratme provided by Perkm-Ehner, ,g„ t™w2.pe*m- bner.com). 

Other suitable ampliflcatfon methods include, but are not lmuted to, bgase 
chamreaetiou (LCR) (see Wu & Wallace, Geeomto 4:560 (1989), Undegrenettu Science 
241 1077 (1988), and Baninger e< «/., Ge»e 89:117 (1990)), transcription ampbficanon 

, , , j c ■ tka R/M 17^ n 989ft self-sustained sequence replication 
(KyfohetaL.ProcNatl.Acad.Sci.USA 86.1173 (19Sy», 

.r , j c • rrci 87-1R74 C1990Y, dot PCR, and linker adapter rCK, 
20 (Guatelli e. al, Proc. Nat. Acad. Sci. USA 87.1874 ITO QO 



etc. 



Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 ^^^--^.^cf^—^W-* 

pro teins whicb can then be used in screening assays, as described below. Expression vectors 
andrecombinantDNA technology are W ell known to those of skiUmthe art ( Se e. e.g 
AusubeUupra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-repUcatmg 
30 extraclnomosomalvectorsorvectorswMchintegrateintoahostgenome. Generally, these 
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«W t o ft. mrc,.ic acrd — *^ o „ of ^ opeBbly ^ codmg sequence m a 
refers to DNAse,uenc««sed for the ^ for pIokaiyotes> e.g., include a 

^ory^is openly Un.edtoDNAfo.apoW.ae.f...^^^,^ 

,0 a coding sequence if it affects the transcription of the seance; or a nbosome bm 
Zliuledroaoodurgseqaenceifitisposirioaedsoas^facffiute—n 

5 llthavetobeconuguous. Linking is typically accomplished by ligation at C °™^^^ 
cotton sites. If such sites do no, exist, synmehc ohgonucleolide adaptors or Unto rue 
usedin accordance with convenbona, p^tice. Transcriptiona. and l^lario^ r« 
nucleic acid will generally be appropriate. ofte host «Uuse4«o express Ihehrensloanc^ 
prorein. Numerous types of appropriate expression vectors, and suitable regulatory 

!0 sequences me known in the art for a variety of host cells. 

„ preferred embodiment, the regulatory sequences include a promoter and transanal 

25 start and stop sequences. } 

Promoter sequences encode either constitutive or inducible promoters. Tbe 

promoters may be eimernatura^^ ^ 
promoters, which combine elements of more than one promoter, are also known m the art, 

and are useful in the present invention. 
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„ addition, an expression vector may comprise additional element For 

vectors am w,U known in the art (o.g., Femande, & Hoeffler,^™). 

ta addition, in a preferred embodimerrt, the expression vector contatns a 

well known in the art and will vary with the host cell used. 

Thehreasteonoerpretetasofmepreaentinvendonareprod^erlbyontonga 

M«to.«> veeBrcontaintognnclem^ideneodingabre^. 
c^oer protein, under the appropriate conditions to induce or cause expressron of the breast 
cancer preterm Conditions appropriate for breast cancer protein expression vnll vary wrft the 
choice of ttre expression vector arrd the host cell, and will be easily ascertained by one stalled 
in the art through routine experimentation or optimization. For example, the use of 
^stimtivepremotemmmeexpressionvec.orwillrer.uhcoptimizingaegrowthand 

pmtiferation of the host cell, while ft. use of an inducible promoter rerpures the approbate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculovunl systems used in insec, cell expression are lytic 
viruses, and thus harvest time selection can bo crucial for product yteld. 

Appropriate boat cells include yeast, bacteria, archaebacteria, fungi, and msect 
; ^animal cells, including mammalian cells. Of particular interest .re Sancton^ 
cere^e and other yeaats, E. coti. Bacilte MU, SB cells, C!29 colls, 293 cells, 
Neurosporc, BHK, CHO, COS, HeLa cells, HUVEC (human nmbffioal vein endoftehal 
cells) THP1 cells (a macrophage cell line) and various other human cells and cell tales. 

In a preferred embodiment, the breast cancer proteins are expressed m 
30 mammalian cells. Mammalian expression systems are also known in the art, and include 
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wm orv iro s OT promo.er,aae n ov iroS major U tt promo« e r, herpes s,mp.ex v„c,.er, 

tok.hee^.equene.Ex^of.^^oa— »—■** 

include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 

well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e;g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient nbosome 
binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to aUow for the selection of 
30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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• • to drugs such as ampicillin, chloramphenicol, erythromycin, 

the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, breast cancer proteins are produced m msect cells. 
10 ExpressionvectorsforthetransformationofmsectceU,^ 

expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced xn yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae. Candida albicans and C. maltosa, Hansenula polymorpha, 
15 KluyveromycesfragiliszndK. lactis, Pichia guillerimondii and P. pastoris, 
Schizosdccharomyces pombe, and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 

desired epitope is small, the breast cancer proteinmay be fused to a carrier protein to form an 
20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protem to 
increaseexpression,orforotherreasons. For example, when the breast cancer protem > a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 
25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present m the 
sample Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the breast cancer protem 
30 may be purified using a standard anti-breast cancer protein antibody column. UltrafiltraUon 
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• , n ntion with protein concentration, are also useful. For 
and diafiltration techniques, in conjunct wuh prot Pu M eat km (1982). 

nucleicacidsareusefulinanumberofapphcahons. They may be used 
reagents, as vaccine reagents, as screening agents, etc. 

*. J,afi,e breast cancer peptide will often contain a, « one ammo ^ — M 

15 ^ ^otaotadeowitoooeembodtaen.ofbreaB.cancerpro.einsof.hepresen, 
invention are ammo acid seance variant, These variants typically M into one or more of 
Unee ctasses: substitutional Sectional or deletions! variants. These variant ordinarily are 
p^bystospeciflemu^esisofn.oleo.ide.in^DNAe^thel^ca^ 

protein ^ingcassefteorPCRmn^enerisorototechniqneaweUknovvnrntheari,^ 
produceDNAencodingthe variant, and dreaoafter jessing dteDNA in— ant cetl 
cufture as oo«ine4 above. However, variant breast canceapnTtein fragment ha^j ;up.o 

a feature tot sets Are, apart torn naturally occumng alleuc or interspecies vananon of the 
br eas. cancer protein amino acid sequence. The variant typicaUy exhibit Ore sane 
CuaBative bioiogicai activity as the naturally occurring analogue, although vanants can atso 
be selected which have modified characteristics as wffl be more folly outlined below. 

While the site or region for introducing an.amino acid sequence variation » 
30 pntdetetmmed, me mutation per se need no. be predetennined. For example, in order to 
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• j *f c in nWA having a known sequence are well Known, c. s , 

some cases deletions may be much larger. 

SubsHm.to.deMons.teertionsoranycombmaaontoreofmaybeused.o 

Q^ny^echaagesarcdon.onafewamino^to 

circumstances. When smaU M. in the characteristics of the breast 

relationsHps provided to toe deflation aection. 

The variants typieally exhibit the aame qualitative biological actmry and mil 

selected to modify the characteristics of fire breast cancer proteins as needefi. Altematrvely, 
the variant may be designed such that the biological activity of toe breast cancer proton, . 
altered. For example, glycosylation sites may he altered or removed 

Substantial changes in function or immunological identity are made by 

techargeorhydmptobicityofmemoiecnkatmetarget^ortoebnlkoftoeatdecharn. 

The substitutions which in general are expected to produce toe greatest changes m toe 
polypeptide's properties are those in which (a) a hydropbilic residue, e.g. seryl or threonyl rs 
substituted for (or by) a hydrophobic residue, e.g. leucyl, inoleucy!, phenylalanyl, v*l or 
30 alanyl; <b) a cysteine or proline is substituted for (or by) any other residue; (c) a resrdue 
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• • , u •„ * o lvsvl arginyL or histidyl, is substituted for (or by) 
havine an electropositive side chain, e.g. lysyi, arginyi, 

having an ele p & ^.^ ^ & ^ Slde 

an electronegative residue, e.g. glutamyl or ^ v j 

Covata. m.xiif.c^ns of br^t cancer polypeptides are mduded ^ttan to 

w j^-w;™ inr.luHes reacting targeted amino 

scope of this invention. One type or covaiem modlfica * 

ofreactingwith selected side chains or the N-orC-terminal residues of a breast cancer 

polypeptide. W^i*l^^^ fa ^ baB ^ 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 
method for purifying anti-breast cancer polypeptide ^bodies or screening assays, as is more 
folly described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifonctional imidoesters, including disuccinimidyl esters 
such as 3,3'-ditfoobis(succimmidylpropionate), bifonctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as mefoyl-3-((p-azidophenyl)aUthio) P ropioimidate. . 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, axginine, and histidine side chains (Creighton, 
Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylate pattern 
of the polypeptide. "Altering the native glycosylate pattern" is intended for purposes 
herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylate sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylate patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylate patterns. 
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parhcularly y ^ ^ aclds . 

polypeptide. Such methods are desenbed in the art, e.g, in 

Wriston, OtC CrU. Be,. BioCe... pp. 259-306 (.981). 

R^o^lofearbohydratemoiehespresentonthebreasteanc p yp P 

^oeaecompHshedehenUcattyoren^-Uyorh, — suh^tiono eodons 

rooXforlh.aoidresiduesthetse.eastargetsforg.y^on. Chemical 

Lof.^of^-andmto-g.ycosidasea^dese^by^^^^.M* 

Btaymo/., 138:350 (1987). 

Another type of eovalent modification of bieaat cancer comprises Imkmg the 

breast cancer polypeptide ,0 one of a variety of nonproteinaceoiis polymers, e.g., 
po.yemy^glycoi.po.y^ 

U S P l« No, 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breastcancerpolypeptidesoftirepresentinventionmayalsobemodrfiedina 

Zl onspolypeptideoraminoacidscnence. In one en^odinrent, snoh a cfcmenc 
^^a^on.f.b^o.^poly^tide^.UgPO.ypeptidewh.oh 

ideaan^e.^ananti-ugan^y-ae^ybmd.^^^ 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
fflS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 
10 al, Mol Cell Biol 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
{Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al t 
15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J. Biol Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc. Natl Acad. Set. USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 
20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 
25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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. or determinant with the full length protein. By "epitope" or 
should share at least one epitope or determinant wi 

antibodies generated to , unique epitope show Utile or no cmss-mactivny 

Meti,od S ofprepa ltae potyo 1 o n a 1 a 0 tibodie S are k ,ow„toti,es 1 a..eda t ti S an 

(eg Cougan.^aodHartowfcUne,^). Polyciona. antibodies eanberatsed ma 

encoded by a nuo.eie aoid of the tigures or fragment thereof or a fusion proton, thereof. It 

the mammal being immunised. Examples of such immunogenic proteins inolude but are no, 
timiteo to keyhole Ihnpe. hemocyanin, serum aibumin, bovine ttyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants whioh may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by on. akilled tnthe art 

without undue experimentation. 
3 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 

appropriated v 
Oocytes that produce or are capable of producing antibodies that will specificaUybmd to 

, 5 theinmrumzmgagent.Alternatively.melymphocvt^ 
U^uaizmgagentwiUtypicaUymcludeapolypeptide 

25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ('TBLs") are used if ceUs of human origin are desired, or spleen cells or lymph 
no de cells are used if non-human mammalian sources are desired. The lymphocytes are then 
30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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glyc0 , «o form a hybridoma ceU (Coding, - *— - 

pp. 59-103 098W Usually> ra , M 

particularly myeloma cells of rodent, Bovine an 

medium ma. preferably eontains one or more substances ma. mhtbi 
fceuufitsrf Lnortafizedodls. For example, if me paremal oella laok me enzyme 
memnuseo,imii mppnT nr HPRT} the colmre medium 

hypoxanlhine gnanine phosnhoribosyl transferase (HGPRT or HPRT), 
fie hybridomas typtcaUy »«. ino,ude hypoxanthine, aminoptenn, an uiynudnte ( W 
medium"), which substances prevea. me growth of HGPRT-deficien. ceils. 

ta one embodiment, the antibodies are bispecific antibodies. Btspectfic 

pities for a, Last two different antigens or that have binding species for two 
epimpeaonme.ameanngen.lnoneembodta.nt.oneofmebmmngapee^.ueats^a 

pLLeneodedbyanucleioaeidTablesl^oranagmen.mereof.theotherone.forany 
other antigen, and preferably for a ceU-snrface protein or receptor or receptor suborn., 
preferably one that is tumor specific. Alternatively, tefiamer-rype technology may create 

multivalent reagents. „ 

I„ a preferred embodiment, the antibodies to breast cancer proton are capable 

of reducing or eliminating a biological function of a breast cancer protein, as is descnbed 
below Thatis,feeadditionofanti-br^^ 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminatethebreast cancer. GeneraUy, atleasta25% decrease in activity, growth, s^ze or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
1 00% decrease being especially preferred. 

In a preferred embodiment the antibodies to the breast cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are clumenc 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab Fab' F(ab')2 or other antigen-binding subsequences of antibodies) which centaur 
30 mimmalsequencedenvedfromnon-humanimmunoglobulm Humanized antibodies include 
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human species (donor antibody) such as mouse, rat or 

aflhtity and capacity. In some instance, Fv framework ^ 
tamunogiobuUn are replaced by corresponding non-human resrdues. Hummed 
^rcomprise residues which are found neititer in me recipient anhbody nor .» the 

^ a " m ^r 

snbstantially an of a. leas, one, am Really two, variable domams, m wh.ch aU o 

„,„ all or substantially all of the framework (FR) regions are those of a human 

^conLna^Tnehunram^anfrbody — ^^con^ 

^^(Jones*,*/.,^^^^^^ 

332,23 329 (.988); andBresta, C*r. * M« M 2:593-596 (1992),. Hunramzation 
^32 1 :522-52 5 (1986);Rie<^e,n,.,^»2:323-327(1988) ; V^oey«a,,. i ., 

&fa» 239: 15 34-1536 (1988)), by substituting rodent CDRs or CDR sequences for tire 
^ndings^uencesof.hnman antibody. Acc.rdingiy.suohhumani^-ubodrea^ 
chimeric antibodies (U.S. PatentNo. 4,816,567), wherein substantia)* .eaa man an urtact 
human variable domain has been substituted by the corresponding aeuuenc* from a non- 



20 

human species. 



Human antibodies can also be produced using various technique* known in the 
art, Ending phage disp,ay libraries (Hoogenboom * Winter, J. Moi Bio,. 227:38. (.99, 1 
Marks e, a,., J. Mol Bio,. 222:58. (.991)). H. techniques of Co.e e, a,, and Boemer a, ... 
, 5 ^alsoavmlableformepreparationofhumanmonoclonalantibodiestColeetu)., 

Monoclonal AnModies and Gmcer THerapy. p. 77 (1985) ami Boemer m a,., J. A-~t. 
1 47(l)-86.95(1991)).Sinnlarly,hnn^antibo4ie S oanbenmd.bymfroduomgofhuman 

tounoglobrnin loci into transgenic animals, e.g., mice in wbich the endogenous 
tamnnoglobu.mgen^tavebe^ptutiauyor^mp.ete.yinactiva.^ Upon challenge, 
30 human antibody production is observed, which close.y resemble* that seen inhuman* in all 
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™a ™tibodv repertoire. This approach is 

described, e.g.,ta U.S. Patent Nos. 5,545,807 5,545, , 10:779 . 
5 66. oie.andinfhefollovringseientifiopnbnoa.ion,: V*,**.** . «" 

By tamonotherapy is meant tteatment of breast caneer mth an anrib 
recipient (patient). Induction of an immune response is the result of provx g 
expressing tie antigen and onder condirions for expression of to antigen, leadmg » an 

anribodieansedforn^bnen.bindandpreven.dreseereredproteinfromb.ndrng^^ 
, receptor.therebymacttvaringthesecretedbre^tcancerprotem. 

ta another preferred embodiment, the breast eaneer protem to wtaoh 

^badiea^r^ia.^^P^^^^^,^^ 

lUuvoornncon^riveinBbHorofpro.embind^toto^ntad^nofto 
i-^* Theanribodyisa^.r^onis.oftobreaarca.cerpro.ern. 
FoIt her,the antibody prevent activation of dre membrane brena. cnnoer proten, fn one 
,„ aapecr.whenmeaotibcdypreveo^ebindingofod.crnroiecn.es.odrebreaatcanccr 
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.. n(a . ^ The antibody may also be used to target or 
^fe dre eeU to cyU.torao agents, R cUplatin , 

^ by administering to a patient annbodi* dnected „ 
canoerprotein. Allt ibody. 1 abe«ngmayacdvateaco. to x 1 rr, 1 ooa.»a,ox 1 n P , 

associated with breast cancer. „.v,otoxic 
to a preferred embodiment, me therapeottc morety can also be a ytotox, 

^ent m mis metnod, targetmg me cytotoxic agent to breast cancer tissne or c* » 

corresponaingii c Cytotoxic agents also include 

rrbott^m^mempenncnm^,— OT e breasted proton* no. 
ITlstomclenreic^^^ 

^r^bntaisoservestoredncode^onssid.effectamatn.ybeaaaoc.a.ed.m 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein- In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 

5 the individual or cell Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that'the antibodies bind to the protein with a 

10 K<j of at least about 0.1 mM, more usually at least about 1 \M 9 preferably at least about 0.1 
\xM or better, and most preferably, 0.01 jiM or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 

5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 

10 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PGR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

1 5 about 50%, more preferably at least about 100%, more preferably at least about 1 50%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g. , with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 

5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

1 0 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enryme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 
Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 
nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 

61 



WO 02/059377 PCT/US02/02242 



breast cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 

1 5 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokarnik, et aL 9 Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile" In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

10 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

15 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 

5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

15 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, . 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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la one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the celL 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al, J. Med. Chem. 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, Pept. Prot Res. 37:487-493 
(1991), Houghton et ai, Nature, 354:84-88 (1991)), peptoids (PGT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs et aL, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al. y J. Amer. Chem. Soc. 
114:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et aL, J. Amer. Chem. Soc. 1 14:9217-9218 (1992)), analogous organic syntheses 

5 of small compound libraries (Chen et aL, J. Amer. Chem. Soc. 1 16:2661 (1994)), 

oligocarbamates (Cho, et al % Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et aL, J. Org. Chem. 59:658 (1994)). See t generally, Gordon et aL, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries {see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn et aL, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries {see, 
e.g., Liang et aL, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries {see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g, 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD, (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N. J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 

5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

15 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids.^ 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 

5 with purification and/or amplification such as PGR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminegcent, a chemical, or a radioactive signal, to provide a means of 

10 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

1 5 epitope tag or bio tin which specifically binds to strep tavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 

5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

1 0 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

1 5 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 

5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

1 0 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in; 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 

5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

1 5 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 

5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

1 5 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 
fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., U5 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

1 0 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 

15 of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 

20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

3 0 structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
1 0 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 

1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 

20 the host cells* ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al (1996), 

25 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 

5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 

10 density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts (see, e.g., Temin, J. Natl. Cancer Insti. 37:167-175 (1966); Eagle et al, J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells {see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 

25 in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g. t Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al , /. Biol Chem. 249:4295-4305 (1974); 

30 Strickland & Beers, J.Biol. Chem. 251:5694-5702 (1976); Whure/ al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer y pp. 1 78- 1 84 (Mihich (ed.) 1 985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigei 

The degree of invasiveness into Matrigei or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigei or some other extracellular matrix constituent. In this 
1 0 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in FresTiney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigei or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness* , and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with i25 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi et al t Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al , Manipulating the Mouse Embryo: A Laboratory Manual, 

5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse {see, e.g., Giovanella et al, J. 
Natl Cancer Inst. 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 

10 mouse (see, e.g. t Bradley et aL $ Br J. Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 

5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al.;Adv. in 

Pharmacology 25: 289-3 17 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., inHampel et al, 
Nucl Acids Res. 1 8:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
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WO 94/26877; Ojwang et al y Proc. Natl Acad. ScL USA 90:6340-6344 (1993); Yamada et 
al, Human Gene TJierapy 1:39-45 (1994); Leavitt et al. 9 Proc. Natl Acad. Sci USA 92:699- 
703 (1995); Leavitt et ai, Human Gene Therapy 5:1 151-120 (1994); and Yamada et al t 
Virology 205: 121-126 (1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti -breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 

25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et al , Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

10 pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fiimaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylaraine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 

5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a< 
protection barrier. Means of protecting agents from digestion are well known in the art. 

10 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a phannaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

15 contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington 's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g. 9 Remington 's Pharmaceutical Science and Goodman and Gillman, The 

30 Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be 

administered for therapeutic or prophylactic treatments. In therapeutic applications, 

compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 

amount sufficient to cure or at least partially arrest the disease and its complications. An 

5 amount adequate to accomplish this is defined as a "therapeutically effective dose. 1 ' Amounts 

effective for this use will depend upon the severity of the disease and the general state of the 
« 

patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

1 5 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 

5 volume 152 (Berger), Ausubel et aL, eds., Current Protocols (supplemented through 1999), 
and Sambrook et aL, Molecular Cloning - A Laboratory Manual (2nd ed., Vol 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 

1 0 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 

15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides (see, e.g.,Vitiello, A. et al.,J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al.,Molec. Immunol. 28:287-294, (1991); Alonso et aL, 
Vaccine 12:299-306 (1994); Jones et aL, Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et aL, Nature 
344:873-875 (1990); Hu et aL, Clin Exp Immunol. 1 13:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tmx,Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol. Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkus, et aL, In: Concepts in vaccine development (Kaufmann, ed., p. 379, 1996); 
Chakrabarti, et aL, Nature 320:535 (1986); Hu et aL, Nature 320:537 (1986); Kieny, et aL, 
AIDS Bio/Technology 4:790 (1986); Top et al., J. Infect. Dis. 124:148 (1971); Chanda et al., 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et aL, J. 
Immunol. Methods. 192:25 (1996); Eldridge et al., Sem. Hematol. 30:16 (1993); Falo et al., 

30 Nature Med. 7:649 (1995)), adjuvants (Warren et aL, Annu. Rev. Immunol. 4:369 (1986); 
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Gupta et al, Vaccine 11:293 (1993)), liposomes (Reddy et al, J. Immunol 148:1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al, 
Science 259:1745 (1993); Robinson et al 7 Vaccine 1 1:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol 12:923 (1994) and Eldridge et al, Sem. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 



substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
poiyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-m'ediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 



invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 



Vaccine compositions often include adjuvants. Many adjuvants contain a 



For therapeutic or prophylactic immunization purposes, the peptides of the 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et aL, Nature 351 : 456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein (see, e.g., Shata et al., 

10 MolMed Today 6:66-71 (2000); Shedlock et al.JLeukocBiol 68:793-806 (2000); Hipp et 
al.Jn Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

15 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutical acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25 (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 

5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 

1 0 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



15 EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3 100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenizatioa 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 

min. at4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
1 5 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, wann up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
1 0 resuspended in Wash Buffer O W2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 

of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 

1 5 The absorbance is next read to determine the yield, using diluted Elution • 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 

20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 1 00% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 

25 without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 

Alternatively the RNA may be purified using other methods (e.g.. Oiagen's RNeasv kit). 
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No more than 100 ug is added to the KNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >1 0,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000 rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >10,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifliged 
for 1 min at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of poly A+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 

T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
. Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add; 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add lOulof 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chlorofonn:Isoamyl Alcohol (25:24: 1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 

10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription flVD and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). Hie labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to F/T): The following mixture is 
prepared; 

1 5 IVT antisense RNA; 4 ug: j-tl 

Random Hexamers (1 jig/|il): 4 jxl 
H 2 0: |jl 

14 ul 

Incubate the above 14 ul mixture at 70°C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3^1 

50XdNTPmix: , 0.6^1 

H 2 0: 2.4^1 

Cy3 or Cy5 dUTP (lmM): 3 ul 
25 SS RT II (BRL): 1 jil 



16^1 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 ul SSII is added and incubated for another hour before being placed on ice. 
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The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 pi each of lOOmM dATP, dCTP, and dGTP; 10 pi of 
lOOmM dTTP to 15 pi H 2 0. ] 

RNA degradation is performed as follows. Add 86 pi H 2 0, 1.5 pi 1M NaOH/ 
5 2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 pi TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 pi buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

10 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 pi; 50X dNTPs, 1 pi; 20X SSC, 2.3 pi; Na pyro 
phosphate, 7.5 pi; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21 .8 final vol. Dry 
in speed vac. Resuspend in 15 pi H 2 0. Add 0.38 pi 10% SDS. Heat 95°C, 2 min and slow 
1 5 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H z O; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 



15 R1: 



Unigene Title: Unigene gene title 



Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTKIe 

20 100472 090084 Hs.1023 pyruvate dehydrogenase (Hpoamide) alpha 

100499 T51986 Hs.283108 hemoglobin, gamma G 

100545 M55405 gbtfomo sapiens mucin (MUC-3) mRNA, part 

100549 BE142019 Hs.222056 Homo sapiens cDNA FU11572 fis, done HE 

100613 X52078 Hs.101047 transcription factor 3 (E2A Immunoglobul 

25 100635 BE259039 Hs.129953 Ewing sarcoma breakpoint region 1 

100645 X16841 Hs.167988 neural cell adhesion molecule 1 

100654 A03758 NM_000477**Xomo sapiens albumin (ALB), m 

100702 L27065 gb:Human neurofibromatosis 2 (NF2) mRNA, 

100915 M60832 Hs.249239 collagen, type VIII. alpha 2 

30 100971 BE379727 Hs.83213 fatty add binding protein 4, adipocyte 

101125 AJ250562 Hs.82749 transmembrane 4 superfamDy member 2 

101165 M90424 Hs.2099 lipocalin 1 (protein migrating faster th 

101184 NM.001674 Hs.460 activating transcription factor 3 

101336 NM.006732 Hs75678 FBJ murine osteosarcoma viral oncogene h 

35 101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 

101447 M21305 gb:Human alpha satellite and satellite 3 

101461 N98569 Hs.76422 phospholipase A2, group IIA (platelets, 

101511 M27826 Hs.267319 endogenous retroviral protease 

101634 AV650262 Hs.75765 GR02 oncogene 

40 101736 M74447 Hs.502 transporter^ ATP-binding cassette, sub 

102208 U22961 gb:Human mRNA done with similarity to L 

102297 NMJ1Q1504 Hs.198252 G protelrvcoupled receptor 9 

102450 U48251 Hs.75871 protein kinase C binding protein 1 

102515 U89337 Hs.169886 tenascinXB 

45 102571 U60115 Hs.239069 four and a half UM domains 1 

102800 AA313538 gb:EST185419 Colon carcinoma (HCC) ceU 

102857 NMJD06744 Hs.76461 retinol-binding protein 4. interstitial 

102990 AA829286 Hs.332053 serum amyloid A1 

103434 X98085 Hs.54433 tenasdn R (restrlctin, janusin) 

50 103747 AA081995 gb:zn26d06.r1 Stratagene neuroepftheliurn 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepitheilum 

103812 AA137107 Hs.326391 Homo sapiens, done MGC:16638, mRNA, com 

103851 AA326216 Hs.8719 hypothetical protein MGC1 136 

104080 AB041036 Hs.57771 kallikrein 11 (KLK11; TLSP; PRSS20; hipp 

55 104093 R50727 Hs.336970 ESTs 

104106 AA422123 gfczv26MZr1 SoaresJJhHMPu^SI Homo sapi 

104109 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 

104250 F06638 H&12440 Homo sapiens done 24734 mRNA sequence 

104340 AA426189 gb2w11e09.r1 Soares_NhHMPu_S1 Homosapl 

60 104492 N73185 Hs.94285 EST 

104506 N91071 Hs.109650 ESTs 

104511 N99542 Hs.572 orosomucoidl 

104532 AM98763 Hs.203013 hypothetical protein FU 12748 

100 
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104536 R24024 


Hs.158101 Homo saoiens cDNA FU14673 fis. done NT 


5 


104572 Y11312 

■ **»w I ^ i 1 )w 14 


Hs 132463 DhosDhohositide-34cinase dass2 beta 


5 


104659 AW969769 


Hs 105201 ESTs 


5 


104677 AA009764 


Hs 190380 ESTs 


10 


104711 AA017245 


Hs 32794 ESTs 


10 


104731 AA019300 


Hs 125070 ESTs Moderately similar to 154374 aene 


10 


104764 A1039243 


Hs 278585 ESTs 


5 


105005 AI298208 


Hs.28B05 ESTs 


10 


105036 AA1 30390 


Hs.25549 hvoothefical orotein FU2DB9B 


10 


105105 R61532 


Hs 8701 6 hvoofhptteal nrotpin FLI99Q38 


5 


105231 AW970043 


Hs 238039 hvnofhetieal nrotein FU1 1090 


5 


105239 AA221036 


oh*7rQ3i19 rl Stratanpna NT? neumna! nr 

yLi-ti wv/i 1 £ — 1 i uuaujyciio lilt iicuivwicli pi 


10 


105921 AA421973 


Hs.169119 ESTs WeaWv similar to T25731 hvoatheti 


5 


105957 BF949R57 


Hs 77091 hvnofhpfical nrnipb PlJ11t W 


c 


108059 M7Q8R5 




10 


1081 1Q Al 35QR94 


t-l«111A7 kiAA14<v1 nmfpln 


c 

3 


1081fl1 AIRfYlfiM 
IvOlgl MI0UJO3I 


He 1Q1ROR PQTc 


IU 


10R1Q4 AWQ7R171 
IUOI34 rtVva/OI/l 


He 9RA1Q4 him^fhofipal nrnfo'm PI 199911 


0 


106283 A108584B 


Hs 9^599 K1AA1&AA nmteln 


10 

IV 


M837Q Al nZ9TlKQ 


He11Q091 nk'P7P414MnR1 nmteln 


IU 


106451 AW9T5928 


Hs ^1^182 ESTs 


10 


1084Q1 AA13**888 


Hs IflflRI Hnmncnnipnc dnnp IMARP'41^07Rfi mRMA 


10 


106700 AAQ08434 


Hq *^77fi 7fnr finnpr nrntpin 91 R 


5 


10S7B2 AW05488G 


Hs 25682 Hnmo saniens mRNA lor K1AA 1863 nrotein 


10 


108851 AI45RR93 
IUQO0 1 MJH JOOZO 


nK-fWUnOQ v1 NH CAAP I n94 Hnmn Qantonc 


E 


lUOOrU MiyOO/ OU 


no.zooov Scrum Qcpnvauon response ipjiuopnauuyi 


0 


10R8Q9 AI3A7578 


U<t 19401 hvnnfhattral nmfntn MAr9R0<) 


c 

*J 


106954 AF1 28847 


Hs 9040 18 inrinWhvtarninA N-mfifhvttransfpra^fi 


5 


1flRQQ1 A 199^11 


He 97 hunnftioKral nmfp'n 
no.OUl^/ iiypuuicUwai ptuicn 


c 

□ 


107103 A1446183 

1 w# IVw nlTtvlW 


Hs 9572 ESTs Htahhv similar to CYA5 HUMAN ADENY 

1 lw»vvl w WW 1 Wj 1 llMI I IT wM«H4w4 IV w 1 #«W_f • Wi f lr\l 1 rwUIV 1 


5 


107194 AB008532 


Hs 31449 Reef) nrotptnJike 4 


10 


107148 A1QQ5036 


Hs 334305 GS1^9fulI 


10 


107214 AF1 97026 


Hq S3Q4 rnvnetn IA 


10 


107242 AB020672 

iwi tit noutwi <• 


Hs 175411 WAA0885 nrotein 


10 


1073^1 A 10.05085 

IUf JO 1 MlaUJoCKJ 


Hs IHROJi FQTe 
no. 1 1 1 ouo co 1 o 


10 


107351 1151704 


Hs 393498 P^Ts Mnriprafph/ similar tn Al M8 HUMANA 


c 


107491 \A/9fifi59 




e 


107447 W98518 

IU/****/ VV£OJIU 


Hs 1 Q9 1 0 hvnofh Ptlna \ nrntpin MfifM 1 308 


10 


107451 At 049495 


H<: 983Q7R hunthpfira! nrntpin PR093RQ 


10 
IU 


107411 AI0Q97QO 

I vf *riW MIU9& # oU 


Ks 334703 hvnnthpfifflt nrntpin Ft H459Q 


e 

w 


107450 W38009 


Pmnirirdlhf eotp^tprf (mm APPY einnfp nr 


10 
iU 


107883 N531R7 


Hs 47fi93 PQTs 
no.H r o co co i s 


10 

IU 


107711 WQR141 


He 990(587 PQTc 
no«i^UOO/ CO la 


10 
IU 




nS.^oy244 colS 


IU 


4A77C7 

107757 Bto217^1 


ns.260794 nypotnetjcal protein RJl23o7 similar to 


10 




nS.olZ4o co IS 


in 

IU 


107872 Dci/i/Uo 


ns.95 1 10 cSTs, Weakly similar to A55943 i-pnospna 


in 
10 


107000 AA0258JD 


LJ» 4n4M7 COT. 

ns.19ioo7 ESTS 


0 


107997 AL049170 


Hs.82223 cnordin-uKe 


4A 

10 


Iflftnrc AA0A3R75 


He RORIT. PQTe 
ns.OcOJJ ColS 


10 


108081 AA093668 


Hs.28578 musdeblind (DrosophOaHtke 


5 


108113 AA012881 


Hs.72531 hypothetical protein FU11838 


10 


108238 AA059473 


Hs.66783 EST 


10 


108257 AA677927 


Hs.144269 ESTs 


5 


108335 AA070500 


gb2m70h03^1 Stratagene neuroepithe&um 


5 


108351 AA071193 


gb'^f79b12.s1 Soares_pineaLg l and>N3HPG 


10 


108382 NMJ06770 


Hs.67726 macrophage receptor with collagenous str 


5 


108392 AA075124 


gb:zm85a01 .s1 Stratagene ovarian cancer 


10 


108441 AA079079 


gb2m97c09^1 Stratagene colon HT29 (937 


10 


108446 AA085383 


gbzn13g03^1 Stratagene hNT neuron (937 


10 


108497 AA074897 


gb2m85a05.ii Stratagene ovarian cancer 


10 


108604 AA934589 


Hs.49696 ESTs 


5 


108662 AF1 17646 


Hs.156637 Cas-8r-M (murine) ectropic relroviraJ tr 


5 


108706 AA121820 


Hs.74569 WAA0842 protein 


10 


108738 AA126583 


Hs.158725 ESTs 


10 


108827 AI273692 


Hs.1 10470 ESTs 


10 


109123 AI028376 


HSJ3232 ESTs 


10 
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109389 AA101325 Hs.86154 hypothetical protein FU12457 

109546 F01449 Hs.26954 Homo sapiens mRNA; cONA DKFZp762G1 23 (fr 

109919 R40604 Hs.129539 ESTs, Weakly similar to MCAT HUMAN MITOC 

110006 AI094674 Hs.30524 ring finger protein 24 

110141 H46749 Hs.31540 ESTs 

110354 W22165 Hs.22586 ESTs 

110433 AW294162 Hs.301062 UDP-N^cer/Mpha^galactosamine:polyp 

110448 H51276 Hs.13526 hypothetical orotelnFUl 2688 

110455 H52576 flb*ytB5e08.rt SoarejrineaLgland N3HPG 

110540 H72639 Hs.167608 ESTs 

110553 H60593 Hs.124990 ESTs 

110976 AL044174 Hs.159526 patched (Drosophila) homolog 

110987 AI753316 Hs.26034 ESTs 

111158 N66616 Hs.138629 H.sapiens mRNA for subtelomeric repeat s 
1 1 1 168 AI798376 gb:tr34b07j(1 NCLCGAP Ov23 Homo sapiens 

111187 AJ224864 * Hs.9688 leukocyte membrane antigen 

111307 AA641636 Hs.37477 ESTs, Weakly similar to T46908 hypoihefi 

111400 R00144 Hs.189771 ESTs 

111498 A1168511 gb:ow90h09.s1 SoaresJetaUver spleen 

111651 R16733 Hs.20499 ESTs " 

111738 R26065 gb:yh39d03.s1 Soares placenta Nb2HP Homo 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5 HUMAN A 

111995 R42333 Hs.302292 ESTs 

112071 AL1 17490 Hs.47225 Ras-associated protein Rapt 

112204 NM_006668 Hs.25121 cytochrome P450, subfamily 46 (cholester 

112258 R51889 Hs.24990 ESTs 

112490 R31094 Hs.24378 ESTs 

112588 R77302 gb:yi75h08.s1 Soares placenta Nb2HP Homo 

112654 BE618629 Hs.268809 ESTs 

112784 T98628 Hs.191290 ESTs 

112817 AI057205 Hs.14584 ESTs 

112885 AA581426 Hs.5021 EST 

112913 T16837 Hs.4241 ESTs 

113149 T51588 gbyb27e06.s1 Stratagene fetal spleen (9 

1 13174 T54659 Hs.301755 Homo sapiens cDNA RJ 11465 fis, clone HE 

113203 AA743563 Hs.10305 ESTs 

113299 AW207424 Hs.332594 ESTs 

113367 N92359 Hs.14518 ESTs, Moderately similar to A48752 B-cel 

113457 R16763 Hs.268679 ESTs 

113563 AA913635 Hs.326413 Homo sapiens cONA FU20812 fis, done AO 

113574 R06874 Hs.268628 ESTs, Moderately similar to ALU1_HUMAN A 

113776 AI791905 Hs.95549 hypothetical protein 

113790 AI244311 Hs.26912 ESTs 

113807 W07586 Hs.8045 ESTs 

113958 W86195 gb:zh54e05.s1 Soares JetaLCver.spleeru 

114211 Z39319 Hs.27347 EST 

114254 AB018263 Hs.180338 tumor necrosis factor receptor superfaml 

114349 AA745978 Hs.28273 ESTs 

114449 AA020736 gb2e63b11.s1 Soares refinaN2b4HR Homo 

114484 AA034378 Hs.267319 endogenous retroviral protease 

114576 AA065096 gbzm50a02.s1 Stratagene fibroblast (937 

114624 AA081507 gbzn05b10.r1 Stratagene hNT neuron (937 

114844 AA234826 Hs.87386 EST 

114906 AA234462 Hs.87350 ESTs 

115624 AK000725 Hs.50579 hypothetical protein FU2071 8 

1 15666 AF173081 Hs.178215 Vertebrate UN7 homolog 1, Tax Interact! 

115712 AB020649 Hs.74569 WAA0842 protein 

115889 AA398841 Hs.39850 hypothetical protein FU20517 

115949 AI478427 Hs.43125 esophageal cancer related gene 4 protein 

116107 AL133916 Hs.172572 hypothetical protein FU20093 

116180 AA463902 Hs.13522 ESTs. Weakly similar to 138022 hypotheti 

116267 AW968703 Hs.30085 hypothetical protein FU23186 

116291 AW410377 Hs.41502 hypothetical protein RJ21276 

116527 AW194253 Hs.68607 ESTs 

116659 BE314852 Hs.168694 Homo sapiens clone 23763 unknown mRNA, p 

116708 F10528 Hs.70001 ESTs, Moderately similar to JC6169 nude 

1 17058 AW801806 gb:IL5-UM0070-1 10400-O62-gO7 UM0070 Homo 
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117151 
117226 
117323 
117571 
117624 
117673 
117847 
117877 
117919 
118049 
118413 
118613 
118664 
118858 
118902 
119039 
119159 
119216 
119317 
119366 
119378 
119528 
119792 
119800 
119817 
119835 
119923 
119961 
120379 
120931 
121037 
121282 
121382 
121764 
122034 
122441 
122756 
122771 
123601 
123623 
123941 
124215 
124276 
124680 
125099 
125121 
125188 
125284 
125906 
128484 
128511 
128538 
128606 
128850 
128870 
128903 
128931 
129001 
129091 
129101 
129146 
129213 
129228 
129265 
129285 
129346 
129368 



AI803656 

N20468 

AI472863 

N34417 

N26627 

N40551 

N49285 

AW263476 

BE222341 

N53145 

AW955696 

AKJ78236 

N70907 

AL122040 

AA993527 

AI160570 

AF142419 

AA514422 

AK002001 

T77892 

T81824 

W38051 

AL049798 

AF086332 

AF088061 

AF086429 

AW803308 

U34249 

AL042725 

AW136934 

AA907743 

AA401695 

AA405763 

AA421452 

AK000229 

AA447555 

AA458945 

AW135093 

AA609122 

AI024595 

AA621529 

H62570 

H83465 

AK001527 

NM.014312 



Hs.42373 ESTs 



BE299567 

NM_002666 

BE256206 

AA485421 

NM.002250 

R44214 

C16161 

AA193106 

H39537 

AW150717 



AA443323 

AA056483 

NMJJ13403 

AL117472 

A1146494 

U40714 

AA530892 

8E617015 

AF110141 

NM.003877 



Hs.43387 

Hs.44584 

Hs.82364 

Hs.184043 

Hs.182391 

Hs.44268 

Hs.279472 

Hs.90960 

Hs.49688 

Hs.230619 

Hs.102981 

Hs.293907 

Hs.252097 

Hs.15020 

Hs.221849 

Hs.51305 

Hs.90949 

Hs.80552 

Hs58314 

Hs.159590 

Hs.58429 

Hs.62954 

Hs.337461 

Hs.97162 

Hs.142373 

Hs.97334 

Hs.111939 

Hs.164851 

Hs.98017 

Hs.99116 

Hs.95898 

Hs.97282 

Hs.1 12645 

Hs.97508 



Hs.163953 

Hs.112377 

Hs.48403 

Hs.271749 

Hs.103253 

Hs.17775 

Hs.270503 

Hs.10082 

Hs.101189 

Hs.283040 

Hs.180817 

Hs75309 

Hs.296176 

Hs.107242 

HS.107812 

Hs.301463 

Hs.108665 

Hs.108924 

Hs.109525 

Hs 239307 

Hs.171695 

Hs.11006 

Hs.288908 

Hs.1 10776 



gb:yx39b10.s1 Scares melanocyte 2NbHM Ho 

ESTs 

ESTs 

ESTs, Weakly similar to JC4124 pregnancy 
Homo sapiens Ets-1 binding protein (E1B) 
ESTs 

myelin gene expression factor 2 
ESTs 

gb:yv55fD9.s1 Soares fetal liver spleen 

ESTs 

ESTs 

EST 

Homo sapiens mRNA; cONA OKFZp434G1972 (f 
hypothetical proteki FU23403 
pregnancy specific beta- 1 -glycoprotein 6 
homotog of mouse quaking QKi (KH domain 
ESTs 

v-maf muscubaponeurotic fibrosarcoma (a 
gb:yd20f04.$1 Soares fetal Ever spleen 
EST 

Empirically selected from AFFX single pr 

dermatopontin 

ESTs 

ESTs 

ESTs 

ferritin, heavypolypeptide 1 

Human putative zinc finger protein {ZNFB 

gb:DKFZp434B1822_r1 434 (synonym: htes3) 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU20470 fis, clone KA 
ESTs, Weakly similar to KIAA0926 protein 
Homo sapiens cDNA FU20222 fis, clone CO 
EST 
ESTs 

ESTs, Highly similar to G100..HUMAN 110 K 
Homo sapiens mRNA; cDNA DKFZp434D2472 (f 
a disintegrin and metalloproteinase doma 
gbaf47a02.s1 SoaresJotaLfetus_Nb2HF8_ 
gb:yr44a01.r1 Soares fetal Dver spleen 
gb:ys91a11.s1 Soares retina N2b5HR Homo 
hypothetical protein FU10665 
cortic at thymocyte receptor (X. laevis 
hypothetical protein FU10847 
ESTs, Moderately simitar to ALU8JWMAN A 
peridpin 

p75NTR-assocJated ceB death executor; o 
ESTs, Wealdy similar to ALU7.HUMAN ALU S 
potassium intermediate/small conductance 
ESTs 

hypothetical protein PR02543 
chromosome 11 open reading frame 23 
eukaryotic translation elongation factor 
STAT induced STAT InWbitor 3 
Homo sapiens cDNA FU 12965 fis, clone NT 
BPOZ protein 

Human Chromosome 16 BAC clone CIT987SK-A 
ztnedin 

SH3-domain protein 5 (ponsin) 

ESTs. Weakly similar to IRX2J1UMAN 1ROQU 



dual specificity phosphatase 1 
ESTs, Moderately simitar to T17372 plasm 
WAS protein family, member 2 
STAT induced STAT inhlbitor-2 
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129371 X06828 Hs.1 10802 von Willebrand factor 5 

129381 AW245805 Hs.1 10903 daudin 5 (transmembrane protein deleted 10 

129440 W37944 Hs.4007 Sarcolemmal-assodated protein 5 

129441 BE061069 Hs.301943 KIAA0467 protein 10 
5 129516 AF020038 Hs.1 1223 isodtrate dehydrogenase 1 (NADP+), so!u 10 

12S554 BE222078 Hs.1 13069 ESTs ~ 10 

129684 BE622468 Hs.11924 ESTs, WeaWy similar to I36022 hypolheti 5 

129702 AI304966 Hs.12035 ESTs, Weakly similar to I38022 hypotheti 5 

129// 6 AK001676 Hs.i24t)/ hypoiheUcal protein FU 1081 4 10 

10 129893 AK000956 Hs.1 3209 hypothetical protein FU10094 5 

129928 AI338993 Hs.1 34535 ESTs 5 

129973 AJ251760 Hs.273385 guanine nucleotide binding protein (G pr 5 

129977 NMJW0399 Hs.1395 early growth response 2 (Krox-20 (Drosop 5 

130014 NMJJ01 158 Hs.143102 amine oxidase, copper containing 2 (reti 5 

15 130085 M62402 Hs.274313 Insuiin-fike growth factor binding prote 10 

130089 AA452006 Hs.333199 ESTs 5 

130162 W80711 Hs.319946 Homo sapiens mRNA for KIAA1727 protein, 5 

130243 D88435 Hs.153227 cydin G associated kinase 10 

130315 AI241084 Hs.154353 nonselective sodium potassium/proton exc 5 

20 130339 AA435746 gb:zt79e03.s1 SoaresJesfe^NHT Homo sap 5 

130400 V00517 Hs.283108 hemoglobin, gamma G 10 

130436 NMJ01928 Hs.155597 D component of complement (adipsin) 10 

130478 X72308 Hs.251526 small inducible cytokine A7 (monocyte ch 5 

130480 BE222978 Hs.15760 MYG1 protein 10 

25 130494 AW390834 Hs.75874 pregnancy-associated plasma protein A 5 

130563 BE270472 Hs.279900 HSPC015 protein 10 

130589 AL110226 Hs.16441 DKFZP434H204 protein 10 

130606 AI652143 Hs.288382 hypothetical protein FU131 11 5 

130634 AI769067 Hs.127824 ESTs, WeaWy similar to T28770 hypoffietl 3 

30 130683 AA993269 Hs.17872 Homo sapiens, clone IMAGE:3875012, mRNA 10 

130689 NM_006691 Hs.17917 extracellular link domain-containing 1 10 

130716 AA232075 Hs.18259 XPA binding protein 1; putative ATP(GTP) 5 

130718 AF263462 Hs.18376 K1AA1319 protein 10 

130722 1441322 Hs.18441 ESTs 5 

35 130798 M81349 Hs.1955 serum amyloid A4, constitutive 10 

130840 BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs.23954 cerebral cell adhesion molecule • 10 

131261 AA360419 Hs.171776 inositol(myo)-1(or4)-monophosphatase 1 10 

131282 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 10 

40 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 10 

131340 AK000393 Hs.25817 BTB (POZ) domain containing 2 5 

131341 AF110908 Hs.297660 TNF receptor-assodated factor 3 5 
131406 H83294 Hs.284122 Wnt inhibitory factor-1 5 
131489 BE394648 Hs.27414 hypothetical protein 5 

45 131543 AW966881 Hs.41639 programmed ceil dealh 2 10 

131692 BE559681 Hs.30736 KIAA0124 protein 5 

131753 AA829286 Hs.332053 serum amyloid A1 10 

131756 AA443966 Hs.31595 ESTs 10 

131785 H69342 Hs.26320 TRABIO protein 10 

50 131815 AA021258 Hs.32753 ESTs 5 

131819 BE244981 Hs.173103 FE65MJKE2 5 

131828 AJ000263 Hs.278658 keratin, hair, basic, 6 (monilethrix) 10 

131688 AW294659 Hs.34054 Homo sapiens cDNA: FU22488 fis, clone H 5 

131927 AJ003112 Hs.34780 doublecortex; Bssencephaly, X4inked (d 5 

55 131949 AKD00010 Hs.258798 hypothetical protein FU20003 10 

132115 H81604 Hs.178471 KIAA0798 gene product 5 

132177 X80818 Hs.178078 glutamate receptor, metabotropic 4 5 

132296 AA467752 Hs.195161 ESTs 5 

132426 AW118072 Hs.89981 dlacylgrycerol kinase, zeta (104kD) 10 

60 132477 S68874 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 5 

132675 A1291496 Hs.5476 Homo sapiens, done IMAGE:3530123. mRNA, 10 

132796 NM.006283 Hs.173159 transfomiing.adc^coilea^coilcontam 10 

132898 W28548 Hs.224829 ESTs 10 

132905 NM.004235 Hs.7934 Knippel-fike factor 4 (gut) 10 

65 132953 BE175645 Hs.321264 LBP protein 32 5 

133116 BE563966 Hs.6529 ESTs, Weakly similar to 178885 serine/th 5 

133120 NMJM3278 Hs.65424 tetranectin (plasmlnogen-binding protein 10 
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133139 AF052138 Hs.6580 Homo sapiens cDMA: FU23227 us, done C 5 

133163 AA668224 Hs.6634 Homo sapiens cDNA: FU22547 fis, done H 5 

133258 AW956781 Hs.293937 ESTs, Weakly similar to FXD2J-JUMAN FORKH 5 

133272 NM_002776 Hs.69423 kaflUcrein 10 (KLK10) (PRSSL1) (nes1) 5 

5 133379 AA207059 gb:zq80h09.s1 Stratagene hNT neuron {937 5 

133407 AF017987 Hs7306 secreted frfzzled-related protein 1 5 

133552 H21497 Hs7471 BBP-Bke protein 1 5 

133702 L02321 Hs.75652 glutathione S-transferase M5 5 

133719 H26904 Hs.75736 apolrpaprotein 0 5 

10 133731 N71725 Hs.272572 hemoglobin, alpha 2 10 

133789 T85626 Hs76239 hypofliefical protein FU20608 5 

134007 AF072441 Hs7840 caldneurin binding protein 1 10 

134055 D86Q62 Hs.182423 ES1 (zebrafish) protein, human homolog o 10 

134111 A1372588 Hs.8022 TU3A protein 10 

15 134117 AA081846 Hs.7921 Horra)sapier«rnRNA;cDNADKFZp556E183(fr 10 

134177 BE243319 Hs79672 K1AA0652 gene product 5 

134308 AW905827 Hs.81454 ketohexokinase (fructokmase) 10 

134361 BE549343 Hs.82208 acyWoerizyme A dehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 5 

20 134449 L34155 Hs.83450 lamlnin. alpha 3 (nicein (150kD), kalint 5 

134467 A1190413 Hs.8373 ESTs 10 

134496 MS4936 gfoHomo sapiens retinae add-inducible 10 

134510 NMJJ02757 Hs.250870 mitogen-activated protein kinase kinase 10 

134550 M26315 Hs.85258 COS antigen, alpha polypeptide (p32) 10 

25 134577 BE244323 Hs.85951 exportin, tRNA (nudear export receptor 5 

134591 U73394 Hs.166085 kilter ceil immunoglobulin-like receptor 5 

134678 AL008583 Hs.182595 dynein, axonemal, light polypeptide 4 5 

134728 D10216 Hs.89394 POU domain, class 1, transcription facto 5 

134758 NM_000078 Hs.89538 cholesteryl ester transfer protein, ptas 10 

30 134786 T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 10 

134912 T87521 Hs.261457 ESTs 5 

134963 NMJ03394 Hs.91985 wingless-type MMTV mtegration site fami 10 

134969 H22570 Hs.172572 hypothetical protein FU20093 5 

135001 AA302517 Hs.92732 KIAA1444 protein 5 

35 135066 X04430 Hs.93913 Interteukin 6 (Interferon, beta 2) 10 

135173 AL036557 Hs.95910 putative lymphocyte GQ7G1 switch gene 10 

135197 U76456 Hs.190787 tissue inhibitor of metailoproteinase 4 5 

135219 AB002361 Hs.96633 KIAA0363 protein 5 

135250 U83171 Hs.97203 small indudble cytokine subfamily A (Cy 5 

40 135304 M416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs, WeaWy similar to unnamed protein 3 
135417 X55019 Hs.99975 cholinergic receptor, nicotinic, delta p 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 5 

128870 H39537 Hs75309 eukaryotic translation elongation factor 5 

45 129381 AW245805 Hs.110903 (feuo^ 5 (transmembrane protein deleted 5 
130085 M62402 Hs.274313 insulin-Eke growth factor binding prote 5 
130689 NMJJQ6691 Hs.17917 extracellular Ink domain-containing 1 10 
133120 NM_003278 Hs.65424 tetranectin (rJasminogervbinding protein 3 
133407 AF017987 Hs.7306 secreted frtzzleaVelated protein 1 5 

50 133731 N71725 Hs.272572 hemoglobin, alpha 2 5 
134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 5 
135066 X04430 Hs.93913 interteukin 6 (interferon, beta 2) . 10 
135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5 
322580 AK001852 Hs.274151 Ugattn 5 

5 5 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 10 
418043 AW377752 Hs.B3341 AXL receptor tyrosine kinase 5 
427458 BE208364 Hs.29283 ESTs, Weakly similar to LKHU proteoglycan link 5 
446674 AA563892 Hs.306000 solute canter famDy 4 (anion exchanger), memb 10 
449826 U85642 Hs.138506 ESTs 5 

60 RCJi15814_s Human apM1 mRNA for GS3109 (novel adipose specific collagen 10 

YEU024w/RlP1 EST.YEL024W/R1P1 3 



105 



WO 02/059377 



# 



PCT/US02/02242 



TABLE 1A 



10 



Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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Pkey: - Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: GenbanK accession numbers 



60 



Pkey 



CAT Number Accessions 



108448 1 12224 1 AA085383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978 AA075372 M128503 
108497 11007912 AA074897 AA113914 AA084871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AAQ71453 AA078803 
AA148628 AA122204 AA074159 AA126185 AA079117 AA127089 M070912 AA079280 AA131372 AA078833 AA071087 
AA076131 AA071047 AA079401 AA083070 AA102076 AA1 15163 AA074198 AA134725 AA1 13889 AA121 103 AA075041 
AA065148 AA071310 AA101 144 AA079659 AA078931 AA079209 AA070928 AA068994 AA069817 AA076187 AA069053 
AA131489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070053 AA126283 AA1 26078 AA075895 
AA079208 AA074583 AA071086 AA079623 AA070627 AA078802 M076622 AA065051 AA079143 AA071 1 10 AA079434 
AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 18 AA079450 AA085044 AA120938 
AA079200 AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 
AA1 15929 

124215 1597154J H62570H59063 

117058 1219924J AW801806 H90434 BE086530 

110455 46874J H52576 AF085971 H52172 

111168 38585 1 AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659 BE081531 H59570 

111498 411008 1 AI168511 AI022712 AA700366 R07371 R07324 

104340 46289.10 AA426189 F15201 

103747 117944 1 AA081995 AA101099 

134496 46501.1 M64936 A1025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T40311 

AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
103750 118365 J AA126129 AA126033 AA082561 

105239 34624 1 AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AWB47519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 M155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 
C16859 

AL042725 BE063316 AW975610 AA457591 BE062092 AI655202 AA714296 A1267264 AI075321 AA223286 AA071 122 
AA227849 AA216700 AI696002 AA101867 AA099426 AA135997 AL041698 T02815 T51824 AA207189 T59230 T51868 
AA663341 BE165757 AW818104 AW392886 AA584918 AAD99408 AW856395 AW861859 AA053045 
114624 11 1686 J AA081507 M070071 AA070840 AA084362 
106851 322947.1 AI458623 AA639708 AA485409 R22065 AA485570 
108392 113549J AA075124 AA075208 
100545 22955 J 1 M55405 AW752552 

100654 tigr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 100132 100133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 
100702 tlgr_HT3413 L27065 

102208 6735 9 U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 
AI209170 A1186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
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123941 
118049 
102800 
104106 
111738 
113149 
113958 
108335 
108351 
108441 
124276 
101447 
117226 
133379 
119366 
119528 
112588 
114449 
114576 
107459 
130339 



N70808 A1141254 AI383191 A1401237 A1080709 AJ093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 A1133207 AI132980 AI133214 AI064826 AI061615 AH33473 A1174852 A1133404 
AI133272 V00494 M12523 M12523 AI207526 A1133120 A1064802 AI174993 AI1 14729 AI061645 AKJ64716 AI064959 H77388 
T85706 AF075298 A11 10799 D17107 NMJ000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 A1174710 
AI133290 AJ133304 AI174948 AI207484 AI110717 AF074624 A1114515 AF063516 AI110642 AI1 14559 AH 14498 AI1 14759 
AI207568 AI064960 AI174753 AI1 14666 R69184 R0001 1 AI064997 T60501 A1207701 T71735 M385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T7341 3 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI1 32925 AKJ64701 
A1174748 AI114663 A1133104 A1132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
A1444620 T52290 016931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 
N56507 T87485 AI133622 AA343370 T40075T69671 T53849 T74820 AF075316 A!1 10818 T40121 T57381 AI114468 
AA332728 T51362 All 14589 R06691 AI1 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 
AI1 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 A1444602 T60996 AI114792 H9391 1 AI133106 
R10779 A1065020 T90925 T50889 D17029 A1133703 AA333805 AI133040 AI133017 AI064857 All 10730 AF074637 A1207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AH 14676 AI064778 AA035710 W52763 AH 14786 
T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 AH 14714 T58102 AI110809 R28984 AH74854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AH89471 AA005147 AM76102 AI207662 AI192792 Af768421 AI064737 AW051713 AA936693 AI1331 17 AI766232 AJ913646 
T83962 AI065112 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 AI114720 AI433289 
AA046980 AI823482 AH 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T691 15 AI935509 AH50977 T62890 T71374 T68294 AI174774 T6741 1 T68318 
A1064689 T56624 T69O10 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 AA025050 T73330 
W52657 T71984 T69118 W92684 AI114860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 
T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T5721 2 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 A11871 1 1 T64308 T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
A1207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 A1174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 011809 D12412 T64300 
T28321 T55864 
genbanK^AA621529 AA621529 
genbanK_N53145 N53145 
1478^20 AA313538 U88895 U88902 
AA422123__LatAA422123_i 
genbank_R26065 R26065 
genbanleT51588 T51588 
genbanK_W86195 W86195 
genbanKJ\A070500 AA070500 
AA071193 
AA079079 
H83465 
M21305 
N20468 

AA207059,AA207241 
T77892 

W38051 



genbanKjAA071193 
genbankJ\A079079 
genbank_H83465 
entn*_M21305 
genbanU420468 
genbank_AA207059 
genbank_T77892 
NOT,FOUND_entre^.W38051 
genbanLR77302 R77302 
genbankJ\A020736 AA020736 
geabanK_AA055096 AA065096 
W38003j5Lfli W38002js 
genbankuAA435746 AA435746 



107 



WO 02/059377 



PCT/US02/02242 



TABLE 2: Figure 2 from BRCA 001 US 
5 Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unlgene number 

Unlgene Title: Unlgene gene title 

R1: Ratio of normal breast tissue to tumor 
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55 



60 



65 



Pkey ExAccn 


UnigenelD 


Unlgene Tittle 


R1 


100499 T51966 


Hs.283108 


hemoglobin, gamma G 


10 


100549 BE142019 


Hs.222056 


Homo sapiens cDNA FU 1 1 572 Us, done HE 


10 


100654 A03758 




NM_00Q477*:Homo sapiens albumin (ALB), m 


10 


100971 BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


10 


101184 NMJJ01674 


Hs.460 


activating transcription factor 3 


10 


101336 NMJW6732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


10 


101367 X03350 


Hs.4 


alcohol dehydrogenase 1B (class I}, beta 


10 


101447 M21305 




gbiHuman alpha satellite and satellite 3 


10 


101461 N 98569 


Hs.76422 


phosphoQpase A2, group IIA (platelets, 


10 


101511 M27826 


Hs.267319 


endogenous retroviral protease 


10 


101736 M74447 


Hs.502 


transporter 2, ATP-binding cassette, sub-famB 


10 


102208 U22961 




gb:Human mRNA clone with similarity to L 


10 


102450 U48251 


Hs 75871 


protein kinase C binding protein 1 


10 


1028CKD AA313538 




gb£ST185419 Colon carcinoma (HCC) cell 


10 


102857 NM.006744 


Hs.76461 


retinol-binding protein 4, interstitial 


10 


102990 AA829286 


Hs.332053 


serum amyloid A1 


10 


103747 AA081995 




gbzn26d06.rl Stratagene neuroepithelium 


10 


103812 AA137107 


Hs.326391 


Homo sapiens, done MGC:16638, mRNA, com 


10 


104093 R50727 


Hs.336970 


ESTs 


10 


104109 AL353957 


Hs.284181 


hypothetical protein DKFZp434P0531 


10 


104250 F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 N73185 


Hs.94285 


EST 


10 


104506 N91071 


Hs.109650 


ESTs 


10 


104532 AI498763 


Hs.203013 


hypothetical protein FU12748 


10 


104677 AAQ09764 


Hs.190380 


ESTs 


10 


104711 AA017245 


Hs.32794 


ESTs 


10 


104731 AA019300 


Hs.125070 


ESTs, Moderately similar to 154374 gene 


10 


105005 AI298208 


Hs.28805 


ESTs 


10 


105036 AA130390 


Hs.25549 


hypothetical protein FU20898 


10 


105239 AA221036 




gbzr03f12.r1 Stratagene NT2 neuronal pr 


10 


106052 N79885 


Hs.6382 


ESTs, Hfchly similar to T00391 hypofoetl 


10 


106181 AJ803651 


Hs.191608 


ESTs 


10 


106283 A1085846 


Hs.25522 


K1AA1808 protein 


10 


106379 AL042069 


Hs.119021 


DKFZP434N061 protein 


10 


106451 AW235928 


Hs.313182 


ESTs 


10 


106491 AA135688 


Hs.10083 


Homo sapiens, done IMAGE:4139786, mRNA, 


10 


106782 AW054886 


Hs.25682 


Homo sapiens mRNA for K1AA1863 protein, 


10 


107124 AB006532 


Hs.31442 


RecQ protein-tike 4 


10 


107148 AI005036 


Hs.334305 


GS1999fuD 


10 


107214 AF127026 


Hs.5394 


myosin IA 


10 


107242 AB020672 


H3.175411 


K1AA0865 protein 


10 


107331 AI905985 


Hs.111805 


ESTs 


10 


107447 W28516 


Hs.19210 


hypothetical protein MGC11308 


10 


107451 AL042425 


Hs.283976 


hyptheticalprotehPR02389 


10 


107872 BE271708 


Hs.95110 


ESTs, Weakly similar to A55943 1-phospha 


10 


108351 AA071193 




gb2f79b12,s1 Soares_plneaLglandJJ3HPG 


10 


109546 F01449 


Ks.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


110433 AW294162 


Hs,301062 


UDP^^cetyl-aIpha4^alactosamine:polyp 


10 


110976 AL044174 


H3.159526 


patched (DrosophUa) homolog 


5 
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111168 AI798376 

111651 R16733 Hs.20499 

111803 AA593731 Hs.325823 

114484 M034378 Hs.267319 

125284 NMJJ02666 Hs.103253 

128850 AA193106 Hs.180817 

128903 AW150717 Hs.296176 

129346 AF110141 Hs.288908 

129381 AW245805 Hs,110903 

129516 AF020038 Hs.11223 

129554 BE222078 Hs.1 13069 

130085 M62402 Hs.274313 

130243 088435 Hs.153227 

130400 V00517 Hs.283108 

130436 NIVL001928 Hs.155597 

130563 BE270472 Hs.279900 

130589 AL1 10226 Hs.16441 

130683 AA993269 Hs.17872 

130689 NMJXJ6691 Hs.17917 

130689 AA046747 Hs.17917 

130718 N70196 Hs.18376 

130798 M81349 Hs.1955 

130840 BE048821 Hs.20144 

131184 AB040935 Hs.23954 

131282 X0335O Hs.4 

131328 AW939251 Hs.25647 

131543 AW966881 Hs.41639 

131753 AA829286 Hs.332053 

131785 H69342 Hs.26320 

131828 AJG00263 Hs.278658 

132426 AW1 18072 Hs.89981 

132675 AI291496 Hs.5476 

132898 W28548 Hs.224829 

132905 NNL004235 Hs.7934 

133120 NM_0O3278 Hs.65424 

133407 AF017987 Hs.7306 

133719 H26904 Hs.75736 

134007 AF072441 Hs.7840 

134055 086062 Hs.182423 

134111 AI372588 Hs.8022 

134117 AA081846 Hs.7921 

134177 BE243319 Hs.79672 

134369 AF207664 Hs.8230 
134496 M64936 

134510 NIVL002757 Hs.250870 

134550 M26315 Hs.85258 

134758 NWL000078 Hs.89538 

134963 NWL003394 Hs.91985 

135066 X04430 Hs.93913 

408790 AW580227 Hs.47860 

446674 AA563892 Hs.306000 



gb:U34b07Jd NClCGAP_Ov23 Homo sapiens 
ESTs 

ESTs, Moderately similar to ALU5.HUMAN A 
endogenous retroviral protease 
periJipin 

chromosome 1 1 open reading frame 23 
STAT Induced STAT inhfoitor 3 
WAS protein family, member 2 
claudin 5 (transmembrane protein deleted 
isocrtrate dehydrogenase 1 (NADP+), solu 
ESTs 

insulin-like growth factor binding prote 

cydin G associated kinase 

hemoglobin, gamma G 

O component of complement (adipsin) 

HSPG015 protein 

DKF2P434H204 protein 

Homo sapiens, done IMAGE:3875012, mRNA 

extracellular link domain-containing 1 

extracellular link domain-containing 1 

KIAA1319 protein 

serum amyloid A4, constitutive 

small inducible cytokine subfamily A (Cy 

cerebral ceO adhesion molecule 

alcohol dehydrogenase 1B (class I), beta 

v-fos FBJ murine osteosarcoma viral onco 

programmed ceB death 2 

serum amyloid A1 

TRABID protein 

keratin, hair, basic, 6 (monilethrix) 
diacytgrycerol kinase, zeta (104kD) 
Homo sapiens, clone IMAGE:3530123, mRNA, 
ESTs 

Kruppel-fike (actor 4 (gut) 

tetranectin (plasminogen-btnding protein 

secreted fnzzted-related protein 1 

apoBpoprotetn D 

calcineurin binding protein 1 

ES1 (zebrafish) protein, human homolog o 

TU3A protein 

Homo sapiens mRNA; cDNA OKFZp566E183 (fr 
K1AA0652 gene product 
a dislntegrin-Cke and metalloprotease ( 
gb 'Homo sapiens retinoic acid-inducible 
mitogen-activated protein kinase kinase 
CD8 antigen, alpha polypeptide (p32) 
cholesteryl ester transfer protein, plas 
wingless4ype MMTV Integration site fami 
Interteukin 6 (interferon, beta 2) 
neurotrophic tyrosine kinase, receptor, type 2 
solute carrier ramly 4 (anion exchanger), memo 
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103747 
134496 



117944 1 
465011 



Table 2A shows the accession numbers for those pkeys lacking uaigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

111168 3B58SL1 AI798376 $46400 AW811617AW811616V\W^^ 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
M157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
AA081995M101099 

M64936 Ai025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 M121893 AI269283 T4031 1 
AI684569 AA25701 1 AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 M668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA1 13928 AA307904 
C16859 

ligr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 
X51363 X51364 X51365 

U22961 AA203623 AA503337 AI174733 AJ192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 A1378195 
AI209170 AI186653 A1127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 A1745413 
AI685092 AW93426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
N70806 AI141254 AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 A1133207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 AI1 14729 AI061645 AI064716 AI064959 H77388 
T85706 AF075298 AI1 10799 D17107 NM.Q00477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 
A1133290 AI133304 AI174948 AI2G7484 AI110717 AF074624 AI114515 AF063516 AI110642 AI114559 A1114498 A1114759 
AI207568 AW64960 AI174753 AI114666 R69184 R00011 AI064997T60501 AI207701 T71735 M385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI132925 AI064701 
AI174748 All 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T691 95 ATI 33061 T50850 AI400677 AI091136 AA334608 T57411 Z20979 
N56507 T87485 AI133622 AA343370 T40075 T69671 T53849 T74820 AF075316 AM 10818 T40121 T57381 AI114468 
AA332728T51362AI114589 R06691 A1110629AF063503 AH40543 AA334661 AA332720AA343262T73513T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 772126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AI114792 H9391 1 AI133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI110730 AF074637 AI207567 
H71080 T73217AA343950AI174743 AA334224 AA334281 R06692T64739 T40163T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 AKJ64778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 AI114714 T58102 AI110809 R28984 AI174854 AA305675 
AA343S92 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AI1 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AM78102 AI207662 A1192792 AI768421 AI064737 AW051713 AA936693 AI1331 17 AI766232 AI913646 
T83962 AI0651 12 AI207669 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 AI114720 AI433289 
AA046980 AI823482 AI114536 AA860651 AW242644 R07469 AW300438 A1133416 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T6W18 
AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205880 T62895 T69430 T95111 AA025050 T73330 
W52657ni984 T69118W92684AI114860 T62093 T61797AI522333 T733^ 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 AI1871 11 T6430B T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T57903 R16809 T69394 
A1207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 A1312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T58447 011809 D12412 T64300 
T28321T55864 

102800 14782J20 AA313538 U88895 U88902 

108351 genbankJU071193 AA071193 

101447 entre?_M21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 U UnlgenelD: Unigene number 

Unigene TlUe: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



15 



20 



25 



Pkey ExAccn UnigenelD UnigeneTitlo 

101336 NMJXJ6732 HsJ5678 FBJ murine osteosarcoma vtral oncogene h 

102208 U22961 gb:Human mRNA clone with similarity to L 

102990 AA829286 Hs.332053 serum amyloid A1 

1 1 1 168 AI798376 gb:tr34b07.x1 NCLCGAP Ov23 Homo sapiens 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 

130085 M62402 Hs.274313 insulin-like growth factor binding prote 

130840 BE048821 Hs.20144 smaO Inducible cytokine subfamily A (Cy 

131543 AW966881 Hs.41639 programmed cell death 2 

133120 NMJ03278 Hs.65424 tetranectin (plasmlnogen-blnding protein 

134758 NMJWQ078 Hs.89538 cholesteryl ester transfer protein, plas 



R1 

10.0 
10.0 
10.0 
10.0 
10.0 
10.0 
10.0 
10.0 
10.0 
10.0 
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TABLE 3 A 

Table 3 A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

111168 3B585J AI798376 S46400AW811617 AW811616W00557BE142245 AW858232AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 M199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 AA1Q0472 M100774 AA130756 AA157705 AA157730 M157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 A1829309 AW991957 N66951 AA527374 H56215 
AA045554 A1694265 H60808 M149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

1022086735.9 U22961 AA203623 AA503337 A1174733 AI192802 C06092 AA035357 AI190619 A1199244 AI828450 AA602296 AI378195 

A1209170 AI186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 AI685092 
AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI04251 2 AI383957 AA156873 AI333101 N70805 AI141254 
AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 AA877103 W84464 
AA625146 R68379 AI133207 AJ132980 AI133214 A1064826 AI061615 AI133473 AI174852 AI133404 A1133272 V00494 M12523 
M12523 AI207526 AI133120 AW64802 AI174993 A1114729 A1061645 A1064716 AI064959 H77388 T85706 AF075298 AI110799 
D17107 NM.000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AJ174710 AM 33290 AI133304 AJ174948 
AI207484 AI1 10717 AF074624 AI1 14515 AF063516 AI110642 AI1 14559 All 14498 AI1 14759 AI207568 AI064960 AI174753 
AI114666 R69184 R00011 A1064997 T60501 AI207701 T71735 AA385318 H73569 T60496 H94399A11 33158 T74675 AA484750 
T73413T55909 R50261 T72081 N80533 T51 1 89 T74936 A1207490 Al 132925 AI064701 A1174748 AI114663 All 33104 All 32999 
AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 
T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 N55507 T87485 AI133622 AA343370 T40075 
T69671 T53849 T74820 AF075316 AM 10818 T40121 T57381 AM 14468 AA332728 T51362 AI114589 R06691 AI110629 
AF063503 Al 140543 AA334661 AA332720AA343262 T73513T86549 AI114840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 772126 R94135 T83028 T39972 T39896 AI174786 A1132926 R09237 AI064838 AI133660 T60398 T88753 
T55930 T92126 AM44602 T60996 AM 14792 H9391 1 AI133106 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 
AI133040 AI133017 AI064857 A1110730 AF074637 AI207567 H71080 T73217 AA343950 A1174743 AA334224 AA334281 
R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H777B4 R00874 
AI065049 T84512 T55918 AI207595 T39951 AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 
AI064778 AA035710 W52763 AM 14786 T83564 AA341859 T81684 T55769 AH 14710 T51776 AA343213 AI1 14714 T58102 
AI110809 R28984A1174854AA305675AA343592 T53836 T46869T64721 T55508 W05241 T54019 T57945T60513T48364 
AF075308 W86731 T82851 T48269H54053T73211 A1114590 T48317T55965 T74B57 R84226T56552T52231 T74946T76976 
R02576 T95666 AI203974 AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 
A11331 17 AI766232 AI913646 T83962 A1065112 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 
A11 14720 AW33289 AA046980 At823482 AI1 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 
T78943 AI823481 AA845518 AA719124 AA883454 T68850 T69115 A1935509 AI150977 T62890 T71374 T68294 AI174774 
. T6741 1 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T9511 1 
AA025050 T73330 W52657 T71984 T691 18 W92684 A1114860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 
T57232 AI336158 T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61 819 T89358 
T67708 T70918 T59166 AI1871 1 1 T64308 T62071 T69427 AI1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 
A1312890 T67751 AI174983 T51679T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 
T64300 T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 

Ptey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession number 

1U UnlgenelD: Ungene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



15 Pkey ExAccn UnlgenelD Unigene Title 



R1 



100113 NM_001269Hs.84746 chromosome condensation 1 2 3 

100114 X02308 Hs.82962 thymidybte synthetase 29 
100131 D12485 Hs.11951 ectoniideotkJe pyroptoKpte^ 19 

100146 BE185499 Hs.2471 KIAA0020 gene product 19 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclln Wke) (pertostin) 7 5 
100154 H60720 Hs.81892 KIAA0101 gene product 92 
100163 W44671 Hs.124 gene predicted from cDNA with a complete coding sequence ifi 
100220 AW015534 Hs.217493 annexinA2 20 
100265 D38521 Hs.1 12396 WAA0077 protein 15 
100271 BE160081 Hs.256290 S100 calcium-binding protein A11 (catgizzarin) 135 
100275 BE242802 Hs.154797 WAA0090 protein 5 1 
100323 D50920 Hs.23106 KIAA01 30 gene product 19 
100335 AW247529 Hs.6793 platetetectivating factor acetylhydrolase, Isoform lb, gamma subunit (29kD) 27 
100364 N1VL004341HS.154868 carbamoyt-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotase 2.0 
I00372 NM.014791HS.184339 WAA0175 gene product 26 
100393 D84145 Hs.39913 novel RGO-containing protein 32 
I00400 AW954324 Hs.75790 phosphatidylinositol glycan, class C - 15 
100418 D86978 Hs.84790 KIAA0225 protein 2 !o 
100482 M65028 Hs.81361 heterogeneous nuclear rtbonucleoprotelnA/B 29 
100518 NM_004415Hs.74316 desmoplaWn (DPI, DPII) 19 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 57 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 9.0 

100668 105424 Hs.169610 CD44 antigen (homing function and Indian bbod group system) 7.6 
(00678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 
I00685 AA328229 Hs.184582 rlbosomal protein L24 18 

1 00690 AA363256 Hs.1657 estrogen receptor 1 16 

I00783 AF078847 Hs.191356 general transcription factor HH, polypeptide 2 (44kD subunit) 59 

(00850 AA836472 Hs.297939 cathepslnB 17 

100892 BE245294 Hs.180769 S164 protein jj 
I00945 AF002225 Hs.180686 ublqultin protein Dgase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 15 

100969 AA157634 Hs.79172 solute carrier family 25 (mitochondrial carrier; adenhenudeoOdetransbcator), member 5 6.3 

I00988 AK000405 Hs.76480 ubiquitin-Gke 4 114 

I00999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 1.6 

101031 J05070 Hs.151738 matrix metaOoproteinase 9 (gelatinase B, 92kD gelatinase, 92kO type IV coliagenase) 8.2 

101045 J0561 4 gbftuman proliferating ceD nuclear antigen (PCNA) gene, promoter region. 5.0 

J01077 N99692 Hs.75227 Empirically selected from AFFX single probeset v 2.6 

! !! 10 ?? ^l^JHS 93 Prccokgen-rysine, 2-oxogIutarate 5-dioxygenase (lysine hydroxylase, Ehters-Oantas syndrome type VI) 1.4 

101161 NMJ06262Hs,37044 perfpherin 169 

(01186 AA020956 Hs.179681 core-binding factor, beta subunit 20 

101216 AA284166 Hs.84113 cydoiKfependent kinase inhibitor 3 (CDK2-assoclated dual specificity phosphatase) 18 

J01228 AA333387 Hs.B2916 chaperonin containing TCP1, subunit 6A (zeta 1) 17 

101247 AA132666 Hs.78802 glycogen synthase kinase 3 beta 19 

101249 L18964 Hs.1904 protein kinase C. lota 1*5 

101332 J04088 Hs.156346 topoisomerase (DNA) U alpha (170W3) 52 

101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 3 4 

J01352 AI494299 Hs.16297 COX17 (yeast) homotog, cytochrome c oxidase assembly protein 63 

101396 BE267931 Hs.78996 proliferating cell nuclear antigen 4.2 

101445 M21259 gtoHuman Alu repeats in the region S to the small nuclear rib 19 

101470 NfvL000546Hs.1846 tumor protein p53 (U-Fraumeni syndrome) u 
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101478 NNL002890Hs758 RASp21 protein acfoator{GTPaseacfivating protein) 1 2.5 
101483 M24486 Hs.76768 procollagen-proline, 2-oxogkrtarate 4<TKJxygenase (praline 44iydroxylase), alpha poiypepticte I 5.5 
101540 J04977 Hs.84981 X-ray repair complementing defective repair in Chinese hamster cells 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasome (prosome, macropain) 26S subunlt, ATPase, 3 1.6 

5 101580 NM_012151Hs.83363 coagulation factor vlll-associated (intronic transcript) 5.7 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 1.8 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanytate binding protein 1,interferon-tnducfl)ie I 67kD 2.4 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory submut B (PR 52), alpha rsoform 1.3 

10 101734 M74099 Hs.1 47049 cut{Drosophila)-tike 1 (CCAAT displacement protein) 2.1 

101759 M80244 Hs.1 84601 solute carrier family 7 (catkmic amino acid transporter, y+ system), member 5 5.0 

101767 M81057 Hs.180884 carboxylase B1 (tissue) 14.4 

101782 AA306495 Hs.1869 phosphoglucomutase 1 5.2 

101805 AW409747 Hs.75612 stress-induced-phosphopratein 1 (Hsp70/Hsp90-organlzing protein) 8.6 

IS 101806 AA586894 Hs.1 12408 S100 caJcium-binding protein A7 (psoriasin 1) 8.9 

1 01 810 NMJX)0318Hs. 1 8061 2 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 3.2 

101879 AA1 76374 Hs.243686 nuclear autoantigenic sperm protein (histone-binding) 1.6 

1 01 91 1 AA44 1 787 Hs.1 19689 glycoprotein hormones, alpha polypeptide 31 .3 

101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA II 1.8 

20 101973 U41514 Hs.80120 UDP-N-acetyl-alpha-Dsala^^ 2.4 

101983 AI904232 Hs.75323 prohlb'rtin 8.4 

102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 

102036 BE250127 Hs.82906 COC20 (cell division cycle 20, S. cerevisiae, homolog) 2.0 

102083 T35901 Hs.75117 lnterieukin enhancer binding factor 2, 45kD 16 

25 102083 T35901 Hs.75117 lnterieukin enhancer binding factor 2, 4 13 

102107 BE258602 Hs.182366 heat shock protein 75 14 

102123 NM 001809HS.1594 centromere protein A (17kO) 1.8 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunft(50kD) 4.3 

30 102217 AA829978 Hs.301613 JTV1 gene 6.7 

102220 U24389 Hs.65436 lysosomal 4.3 

102234 AW163390 Hs.278554 heterachromafin-like protein 1 19 

1 02260 AL039104 Hs.1 59557 karyopherin alpha 2 (RAG cohort 1 , importin alpha 1 ) 4.4 

102302 AA306342 Hs.69171 protein kinase C-fike 2 2.7 

35 102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophila HP1 beta) 15 

102339 BE378432 Hs.95577 cyduwlependent kinase 4 2.3 

102348 U37519 Hs.87539 alctehyde dehydrogenase 3 family, member B2 2.0 

102349 AU077055 Hs289107 baculoviral IAP repeat-staining 2 3.2 
102369 U39840 Hs299867 hepatocyte nuclear factor 3, alpha 2.0 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102391 AA296874 Hs.77494 deoxyguanosine kinase 15 

102455 U48705 Hs.75562 dtscoidin domain receptor family, member 1 6.9 

102465 NfVL001359Hs.81548 2,4-dienoyl CoA reductase 1. mitochondrial 18 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1, 59kD 15 

45 102489 AL080116 Hs.74420 origin recognition complex, subunit 3 (yeast homotogHke 3.3 

102494 AI188137 Hs.75193 COP9 homolog 2.1 

102501 AF217197 Hs.74562 siah binding protein 1; FBP Interacting repressor, pyrimidine tract binding splicing 3.2 

102522 BE250944 Hs. 183556 solute carrier famfly 1 (neutral amino acid transporter), member 5 18 

102532 AF040253 Hs.70186 suppressor of Ty (S.cerevisiae) 5 homolog 5.7 

50 102564 U59423 Hs.79087 MAO (mothers against decapentaplegic, Drosophila) homolog 1 2.3 

102568 W81489 Hs.223025 RAB31 , member RAS oncogene famfly 5.3 

102580 U60808 Hs.152981 CDP-diacylgr/cerol synthase (phosphafioate cytidyryltransferase) 1 2.1 

102581 AU077228 Hs.77256 enhan<^ofzesle(Drosophaa)rwnTOlog2 16 

102582 U61232 Hs.32675 tubul^specificchaperonee 2.1 
55 102617 AW161453 Hs.198767 COP9 (consfitutive photomorphogenic, Arabidopsis, homolog) s^ 1.8 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102627 AL021918 Hs.158174 anc finger protein 184 (KruppeWike) 13 

102663 NMJX)2270Hs.168075 karyopherin (Importin) beta 2 18 

102676 BE262989 Hs.12045 putative protein 2.3 

60 102687 NMJ)07019Hs.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hyaVoxyacyWCcenzyme A dehydrogenase, type II 6.0 

102696 BE540274 Hs.239 forkheadboxMI 4.2 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 19 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cys-Cys), member 19 2.3 
65 102750 AB014460 Hs.66196 nth (Exoll endonudease IIIHike 1 U 

102801 BE252241 Hs.38041 pyridoxal (pyridoxine. vitamin B6) kinase 6.4 

102812 U90549 Hs^36774 riigrwrobiity group (nonhistone chromosomal) protein 17-Gke 3 16 
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102827 BE244588 Hs.6456 chaperonin containing TCP1, subunit 2 (beta) 56 

102831 AA262170 Hs.80917 adaptor-related protein complex 3, sigma 1 subimlt 20 

(02844 AV65379Q Hs.324275 WW domain-containing protein 1 13 

I02868 X02419 Hs.77274 plasminogen activator, urokinase 44 
I02925 BE440142 Hs.2943 signal recognition partfcte 19kD 

102935 BE561850 Hs.80506 small nuclear ribonudeoprotein polypeptide A 1 14 
102968 AU076611 Hs.154672 memylenetetrahydrofolatedehydn^enase(NAD+ dependent), n^thenyitetrahydrofolatecyclohydrolase27 

I02983 BE387202 Hs.118638 norwnetastatic ceils 1, protein (NM23A) expressed in ' ' 31 

I02985 U95742 Hs.2707 G1 to S phase transition 1 52 

103023 AW500470 Hs.117950 mtiltifuncttonal pdyp^ 1 6 

I03038 AA926960 Hs.334883 COC28 protein kinase 1 25 

IQ3O60 NMJX)5940Hs.155324 matrix metaltoproteinase 11 (MMP11;stromelysln3) 4*5 

I03080 AU077231 Hs.82932 cyclin D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

103089 D31152 Hs. 1 79729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2 4 

103177 BE244377 Hs.48876 famesyWiphosphate famesyltransferase 1 * 35 

103178 AA205475 Hs.275865 ribosomal protein S18 9 9 

103179 NMJ)01777Hs.82685 C047 antigen (Rlwelated antigen. Integrln-associated signal transducer) U 
103181 X69636 Hs.334731 Homo sapiens, clone IMAGE:3448306, mRNA, partial cds Z0 
103185 NMJX)6825Hs.74368 transmembrane protein (63kD), endoplasmic reticulurr^Golgl intemiediate compartment 1.6 
103191 AA401039 Hs.2903 protein phosphatase 4 {formerly X), catalytic subunit 25 

103193 NMJH)4766Hs.75724 coatomer protein complex, subunit beta 2 (beta prime) 22 

103194 NMJ)04939Hs.78580 DEAtVH (Asp-Glu-Ala-Asp/His) box polypeptide 1 6 3 
I03206 X72755 Hs.77367 monokine Induced by gamma Interferon 88 
103223 BE275607 Hs.1708 chaperonin containing TCPl.subunIt 3 (gamma) 30 
103232 X75962 Hs. 129780 tumor necrosis factor receptor superfamSy, member 4 18 
I03238 AI369285 Hs.75189 death-associated protein 56 
I03297 NMJ01545Hs.9078 Immature colon carcinoma transcript 1 19 
I03330 A1803447 Hs.77496 small nuclear ribonudeoprotein polypeptide G 2.5 
1 03349 X89059 gb:H.sapiens mRNA for unknown protein expressed In macrophage 1 6 
03376 AL036166. Hs.323378 coated vesicle membrane protein ^ 1.8 

103391 X94453 • Hs.114366 pyrroline-5-carboxylate synthetase (giutamate gamma-semialdehyde synthetase) 2.3 

103392 X94563 gb:H.sapiens dbi/acbp gene exon 1 & 2. 4.0 
I03430 BE564090 Hs.20716 translocate of inner mitochondria! membrane 17 (yeast) homotog A 1.3 
103491 AF264750 Hs.288971 myeloid/lymphoid or mixeoMineage leukemia 3 ' 5.6 
I03505 AL031224 Hs.33102 transcription factor AP-2 beta (activating enranw-binding protein 2 beta) 5.1 
103547 A1376722 Hs.180062 proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7) 9.7 
103588 NM_006218Hs.85701 phosphoinasitide-3-lunase, catalytic alpha polypeptide 2.0 
103613 NM.000346Hs.2316 SRY (sex determining region Y)iox 9 (campomelic dysplasia, autosomal sex-reversal) U 
103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 2.0 
,03622 AA609685 Hs.278672 membrane component chrornosorne 11, surface marker 1 23 
I03727 AI878883 Hs.296381 growth factor receptor-bound protein 2 13 
I03749 AL135301 Hs.8768 hypothetical protein FU10849 1 8 
I03754 AI015709 Hs.172089 Homo sapiens mRNA; cONA DKFZp586l2022 (from done DKFZp586l2022) U 
I03780 AA094752 Hs.169992 hypottiefical 43.2 Kd protein 73 
I03795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds U 
I03797 AA080912 gb:zn04d03.r1 Stratagene hNT neuron (937233) Homo sapiens cONA done 5- similar 1.5 
103813 AI042582 Hs.181271 CGM20 protein 15 
I03855 W02363 Hs.302267 hypothetical protein FLI10330 15 
103886 AK001278 Hs.105737 hypothetical protein FU10416 similar to constitutive photomorphogenlc protein 1 6^5 
104052 NM.002407HS.97644 mammagiobbi2 29 
I04079 AA251242 Hs.103238 ESTs 14 
104174 AA478984 Hs.6451 PRO0659 protein 56 
,04227 AB002343 Hs.98938 protocadherin alpha 9 16 
I04275 AI751970 Hs.101067 GCN5 (general control of amiiMHidd synthesis, yeast, homotogHike 2 54 
I04325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 63 
I04370 AA324597 Hs.21651 Homo sapiens cONA FU1 2900 fis, done NT2RP2004321 16 
I04423 R83113 Hs.1432 protein kinase C substrate 80K-H 52 
104482 AB037762 Hs.44268 myelin gene expression factor 2 12 
104532 AI498763 Hs.2O3013 hypothetical protein FU 12748 2*1 
I04563 AL1 17403 Hs.306189 DKFZP434F 1735 protein \2 
I04667 A1239923 Hs.30098 ESTs 13 
104757 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 1, member 6 Z3 
104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP [H^apiens] U 
I04806 AB023175 Hs.22982 WAA0958 protein 2.3 
I04827 AW0520Q6 Hs.8551 PRP4/STK/WD splicing factor 109 
I04846 AI250789 Hs.32478 ESTs 56 
I04854 AA041276 Hs.154729 3-f)hospho!nosmde dependent protein Wnase-1 123 
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104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 2 0 

104896 AWI5318 f 23165 ****** ^ DKFZp56402364 {ff0m DK FZp56402364) 1.3 

I04909 AW408164 Hs^49184 transcription factor 19 (SC1) 

104916 AW958157 Hs.155489 NS1-assodated protein 1 17 
104919 AA026880 Hs.25252 prolactin receptor \ a 

I04930 AF043467 Hs.32893 neurexophflin 2 2 2 

104973 NM_015310Hs.6763 KIAA0942 protein 5 0 

104974 Y12059 Hs.278675 brornodomain-containing 4 ™ A 

104975 AL136877 Hs.50758 SMC4(structoralrraintenaiK»ofchrarrwson^^ 2*4 
I04975 AL136877 Hs.50758 SMC4 (structural maintenance of chromoso 2*3 
!2JS ^ll 92 ! 8 Hs - 19322 Homo sapiens. Similar to RIKEN cDNA 2010317E24 gene, clone 1MAGE:3502019, mRNA, partial cds 7.2 
I04979 AA937934 Hs.321062 ESTs 4 3 
I04994 AI499930 Hs.334885 mitochondrial GTP binding protein 35 
I05009 BE379584 Hs.34789 doljchyMiptaph^ 5 ' 5 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 33 
I05028 AI050715 Hs.2331 E2F transcription factor 5, p1304)indlng 2*2 
I05032 AA127818 gb.*zl12a0ls1 SoaresjxegnanLuterus_NbHPU Homo sapiens cDNA done IMAGE:501674 3* 68 

i 05039 AA907305 Hs.36475 ESTs 95 

105041 AB037716 Hs.26204 KIAA1295 protein 2*9 

1 05045 BE242899 Hs.129951 speckle-type POZ protein 38 

I05079 AA151342 Hs.12677 CGM47 protein 95 

105087 AA147884 Hs.9812 Homo sapiens cDNA HJ14388 fts. done HEMBA1002716 5 6 

105088 H58589 Hs.35156 Homo sapiens cONA FU1 1027 fis. done PLACE10041 14 22 
105095 Z78407 Hs.27023 veslde transport-felated protein 22 
105110 BE367350 Hs.33122 KIAA1 160 protein 16 
105126 AW975433 Hs.36288 ESTs 63 
J05127 AA045648 Hs.301957 nudix{nudeoside diphosphate linked moiety XHype motif 5 21 
105141 AA164687 Hs.177576 mannosyl {alpha-1 glycoprotein beta-l.^^tylglucosamhyltransferase, isoenzyme A 27 
105158 AW976357 Hs.234545 hypothetical protein NUF2R ig 
105169 BE245294 Hs.180789 S164 protein \y 
105186 AA191512 Hs.28005 Homo sapiens cDNA FU1 1309 fis, done PLACE 1010076 4 8 
I05254 AA071276 Hs.19469 KIAA0859 protein i'g 
105281 AA263143 Hs.24596 RAD5Wnteracfing protein 2.8 
I05288 N99673 Hs.3585 ESTs, Weakly similar to AF1 26743 1 DNAJ domain-containing protein MCJ fH.sapiensl 1.9 
I05302 AA700122 Hs.3355 sentrin-spedfic protease 80 
105331 AW270037 Hs.179507 K1AA0779 protein 18 
105359 NWL016015HS.8054 CGI-68 protein 82 
I05366 BE264645 Hs.282093 hypothetical protein FLJ21 918 50 
(05373 AW887701 Hs.32356 hypothetical protein FU2Q628 25 
I05374 BE242803 Hs.262823 hypothetical protein RJ 10326 22 
I05387 AW592146 Hs.108636 membrane protein CH1 23 
I05393 AF167570 Hs.256583 interleukin enhancer binding factor 3. 90kD 54 

105399 BE386877 Hs.334811 Npw38^inding protein NpwBP 16 

105400 AF198620 Hs.65648 RNA binding motif protein 8A \fi 

gb2s12g10.s1 NCLCGAP_GCB1 Homo sapiens cDNA done IMAGE:685026 3', mRNA sequence. ' 5.0 

I05507 BE268348 Hs.226318 CCR4-NOT transcripticfl wmplex, subunit 7 16 

I05529 AA113449 Hs.32471 hypothetical protein FU20364 13 

[05530 AB023179 Hs.9059 WAA0962 protein 34 

I05547 AA262640 Hs.27445 unknown 93 

I05564 BE616694 Hs.288042 hypothefical protein FU14299 u 

105596 AA579535 Hs.18490 hypothetical protein FU20452 10 9 

105597 AF054284 Hs.334826 splicing factor 3b, subunft 1. 155kD 19 
105608 AI808201 Hs.287863 hypothetical protein FU12475 17 
105610 AA280072 Hs.99872 fetal Alzheimer antigen 14 
[05617 AK000892 Hs.4069 gtococorticoid rnodulatory element binding protein 1 17 
105620 AW302245 Hs.181390 casein kinase 1, gamma 2 55 
I05658 AA985190 Hs.246875 hypothefcal protein FU20059 94 
105697 AW499988 Hs.27801 zinc finger protein 278 zO 
I05708 R26944 Hs.180777 Homo sapiens mRNA; cONA DKFZp564M0264 (from done DKFZp564M0264) 1 7 
!S2 sema domain ' immunoglobulin domain (ig), transmembrane domain (TM) and short 2.6 
IQ5746 AW151952 Hs.46679 hypothetical protein FU2Q739 1i5 
I05759 A1123118 Hs.15159 chemokha^e factor, alternatively spliced 13 
105771 A1267720 Hs.153221 synovial sarcoma, translocated to X chromosome 1jG 
105820 AA741336 Hs.152108 transcripfionalunitN143 22 
105826 AA478756 Hs.194477 E3ub^uitmfigaseSMURF2 13 
105856 A1262105 Hs.12653 ESTs 2.4 
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105858 AF151066 Hs.281428 hypothetical protein 2 '\ 

105875 AK001708 Hs.32271 hypothetical protein FU 10846 ™ 

105930 AF016371 Hs.9880 peptidyl prolyl isomerase H (cyclophflin H) J* 

106000 AW194426 Hs.20726 ESTs I*' 

5 106011 AW081202 Hs.12284 Homo sapiens, (^ne IMAGE:2989556, mRNA. partial cds 2.B 

106017 AA477956 Hs.26268 ESTs " 

S SSSS IKS E?!^Sto^^tfrrto^D ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1L6 
106094 AA533491 Hs.23317 rr/potheticai protein Fui46B1 

10 106271 M251393 ^289052 Hwn^ia^Simiiar to RIKEN cONA 5430429M05 gene, clone MGC:13155, mRNA, complete cds 10.8 
106288 AB037742 Hs.24336 KIAA1321 protein 

106300 Y1G043 Hs.19114 Wgh-mobtRty group (nonhistone chromosomal) protein 4 o-o 

106333 AL043114 Hs.22410 ESTs, Weakly similar to A54849 collagen alpha 1(VII) chain precursor (H.sapiensj 5.4 

15 106350 AK001404 Hs.194698 cydinB2 \l 

106359 AW390282 Hs.31 130 traramembrane7superfajnily member 2 Jjj 

106381 AB04Q916 Hs.24106 KIAA1483 protein ? ? 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FU21487 fis. done COL05419 ^ 
106457 AF1 19256 Hs.27801 zinc finger protein 278 

20 106470 D63078 Hs.186180 Homo sapiens cDNA: FU23038 fis, clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs \\ 

106586 AA243837 Hs.57787 ESTs 04 

106589 AK000933 Hs.28661 Homo sapiens cDNA FU10071 fis, clone HEMBA1 001702 ™ 

106610 AA458882 Hs.79732 fibutinl l Jt 

25 106624 NMJI03595HS.26350 tyrosylpratein sulfotransferase 2 cejlrtn< „. 

106650 AI049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from clone DKFZp564O0122) 1.6 

106669 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha-1C-adrenergic receptor spfice form 2 [H^apiens] 1.4 
106713 BE614802 Hs.184352 hypothetical protein RJ12549 

106717 AA600357 Hs.239489 TIA1 cyfotoxic granule^associated RNA-binding protein y 

30 106723 BE388094 Hs.21857 ESTs Vl 
106795 AF174487 Hs.293753 Bcl-2-related ovarian killer protein-like 

106829 AW959893 Hs.27099 hypothetical protein FU23293 similar to ARL-6 interacting protein-2 Vof 
106831 BE564871 Hs.29463 centrin, EF-hand protein, 3 (CDC31 yeast homolog) 

106846 AB037744 Hs.34892 KIAA1323 protein l ' 1 

35 106852 AF151031 Hs.300631 hypothetical protein 

106873 N49809 Hs.11197 Homo sapiens, clone IMAGE:3343149, mRNA, partial cds 1W» 

106886 W79171 Hs.9567 GL002 protein \~ 

106906 AA861271 Hs.222024 transcripfion {actor BMAL2 " 

106920 AK001838 Hs.296323 seroiTi/gtucocortjcoid regulated kinase *J 

40 106945 AK000511 Hs.6294 hyrwlhetkal protein DKFZp434L1435 similar to valyl tRNA synthetase b.e 

106973 BE156256 Hs.11923 hypothetical protein JJ 

106977 AL043152 Hs.50421 KIAA0203 gene product 

106978 AW631480 Hs.8688 ESTs Vt 
107004 AA146872 Hs.30O70O hypothetical protein FU20727 \ ° 

45 107029 AF264750 Hs.288971 myeloioVlymphoid or mixed-lineage leukemia 3 J* 

107071 AW385224 Hs.35198 ectonucleotkle pyrojihosprra^^ J-' 

107113 AK000733 Hs.23900 GTPase activating protein " 
107125 AK000512 Hs.69388 hypotiieOcal protein FU20505 

107136 AV661958 Hs.8207 GKD01 protein ™ 

50 107136 AV661958 Hs.8207 GK001 protein *J 
107146 AK001455 Hs.5198 Down syndrome critical region gene 2 

107151 AW378065 Hs.8687 ESTs ?? s 

107155 AW391927 Hs.7946 KIAA1 288 protein ?T 

107174 BE122762 Hs.25338 ESTs ~< 

55 107197 W15477 Hs.64639 gKoma pathogenesis-related protein w 

107221 AW88M11 Hs.81915 leukemfe^^ 1™ 

107243 BE219716 Hs.34727 ESTs. Moderately similar to 138759 zinc finger/leuclne zipper protein {H.sapiensj 7.4 

107248 AW263124 Hs.315111 nuclear receptor co-repressor/HDAC3 complex subunit « 
107263 D60341 Hs.21198 translocase of outer mitochondrial membrane 70 (yeasQ ^"wtog ^ ^Kn-A ^im ati^ki o c 

60 W265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7JiUMAN ALU SUBFAMILY 

107298 N95657 Hs.6820 ESTs] Moderately similar to YOJ1.CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1_CAEEL H y 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 „' 
107316 T63174 Hs.193700 Homo sapiens mRNA; cDNA OKFZp586l0324 (from clone DKFZp586!0324) 2.0 

65 107354 NWL006299HS.96448 zinc finger protein 193 "J 

107392 AW299900 Hs.267632 TATA element modulatory factor 1 \i 

107481 AA307703 Hs.279766 kinesln family member 4A 10 
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107529 BE515065 Hs.296585 nucleolar proton (KKE/D repeat) 30 
JG7554 AA001385 Hs.59844 ESTs j 3 

107681 BE379594 Ks.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 22 
07772 AA018587 Hs.303055 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 21 
I07859 AW732573 Hs.47584 potassium voltage-gated channel, delayeoVectifer, subfamily S, member3 8 4 

107901 L42612 Hs.335952 keratin 6B Js 
107901 L42612 Hs.335952 keratin 6B ft 
I07922 BE153855 Hs.61460 Ig superfamily receptor LNIR £ 2 
I07974 AW956103 Hs.B1712 pyruvate dehydrogenase kinase, Isoenzyme 1 67 
I08040 AL121031 Hs.159971 SWI/SNF related, matrix associated, actln dependent regulator of chromafin, subfamily b, member T 1.5 
I08230 AA054224 Hs.59847 ESTs <o 
I08274 AF129535 Hs.272027 F-box only protein 5 71 
108296 N31256 Hs.161623 ESTs ' 
08496 AA083069 Hs.339659 ESTs 35 
I08607 BE300380 Hs.69476 Homo sapiens cDNAFUl 2758 fis, done NT2RP200 1328 3*4 
108621 AA101809 Hs.182685 ESTs Je 
I08634 AW022410 Hs.69507 ESTs « 7 

108647 BE546947 Hs.44276 homeoboxCIO go 

108695 AB029000 Hs.70823 K1AA1077 protein 70 
[08717 AA122393 Hs.70811 hypothetical protein FU20516 1<3 

I08740 AKJ89575 Hs.9071 progesterone membrane binding protein 2.7 
I08828 AK001693 Hs.273344 DKFZP564O0463 protein i fl 
I08859 AL121500 Hs.178904 ESTs \i 

I08872 H06720 Hs.111680 endosulfine alpha 21 

108891 AI801235 Hs.48480 ESTs 53 

I08894 AK001431 Hs.5105 hypothetical protein FU10569 4 0 

I08955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 56 

I08982 AA151708 Hs.171980 homeo box (expressed in ES ceils) 1 i'r 

108987 AA152178 Hs.23467 hypothetical proteh FU10633 6 2 

109002 AB028987 Hs.72134 KIAA1064 protein < 7 

109011 AA156542 Hs.72127 ESTs {\ 

gb:zo35d07.s1 Stratagene colon (937204) Homo sapiens cDNA done 3' similar to contains AIu repetitive 5.3 

I09068 AA164293 Hs.72545 ESTs 2.9 

109101 AW608930 Hs.52184 hypothetical protein FU20618 1 6 

109112 AW419196 Hs.257924 hypothetical protein FU13782 3*2 

109124 AKD00684 Hs.183887 hypoftietical protein FU22104 17 

109139 AJ132592 Hs.59757 zinc finger protein 281 26 

109166 AA219691 Hs73625 RAB6 Interacting, Idnesin-like (rabkinesin 6) 19 

109198 BE566742 Hs.58169 highly expressed In cancer, rich in leudne heptad repeats 20 

109213 NM_016603Hs.82035 potential nuclear protein C50RF5; GAP-IIke protein 53 

I09220 AW958181 Hs.189998 ESTs 57 

109233 AU077281 Hs.170285 nudeoporin 214kD (CAIN) 53 

I09270 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ fH^apiens) 14 

I09273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (from done OKFZp586F1822) 29 

109313 AF153201 Hs.86276 C2H2 (KruppeMype) zinc finger protein 13 

109341 AA213506 Hs.115099 EST " 29 

109391 AL096858 Hs.184245 KIAA0929 protein Msx2 interacting nuclear taiget (MINI) homobg 1.5 

[09420 H83603 Hs.40408 homeo box C9 22 

I09426 N30531 Hs.42215 protein phosphatase 1, regulatory subunit 6 30 

I09429 AI160029 Hs.61438 ESTs ^ 

I09445 AA232103 Hs.189915 ESTs It 

[09450 AB032969 Hs.173042 K1AA1 143 protein 37 

I09468 NM.015310Hs.6763 KIAA0942 protein 30 

I09478 AW074143 Hs.87134 ESTs 20 

I09570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha 2.1 

i09662 F02614 Hs.27319 ESTs 14 

09825 R71264 Hs.16798 ESTs Jvj 

10039 H11938 Hs.21907 histone acetyitransferase 20 

10056 AA503041 Hs.279009 matrix Gla protein S'J 

I0085 AA603840 Hs.29956 KIAA0460 protein 17 

10110 T07353 Hs.7948 ESTs £9 

!E2 EISLmS^SJ? ^ST s « Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 1.7 

10154 NM.014521HS.17667 SHWomaln binding protein 4 42 
I0240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOCHROME P450 4A1 1 PRECURSOR [H^aplensl 42 
10242 N41744 Hs,19978 CGW0 protein 1 P j 

10259 H28428 Hs.32406 EST s, Weakly similar to 138022 hypothetical protein [H^apfens] Z2 

10312 BE256986 Hs.11896 hypothetical protein FU12089 ^ 
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110330 
110501 
110504 
110525 
110568 



Hs.16621 

Hs.210859 
Hs.37430 



110705 
110742 
110761 
110762 
110765 
110769 
110799 
110805 
110813 
110820 
110840 
110844 
110854 
110856 
110885 
110897 
110915 
110918 
110958 
110963 
110981 
110984 
111125 
111132 
111164 
111172 
111174 
111179 
111184 
111184 
111189 
111216 
111221 
111223 
111239 
111285 
111299 
111312 
111318 
111337 
111352 
111370 
111384 
111389 
111391 
111392 
111452 
111486 
111549 
111585 
111627 
111870 
111937 
111944 
111987 
112134 
112244 
112388 
112456 
112464 
112506 



AI288666 
H55748 
H55915 
H57330 

AK001160 Hs.5999 
T97586 Hs.18090 
AB007902 Hs.32168 
AW190338 Hs.28029 
AL138077 Hs.16157 
BE044245 Hs.30011 
AK000322 Hs.18457 
BE000831 Hs.23837 
AI089660 Hs.323401 
T25829 Hs.24048 
AA767373 Hs.35669 
R33261 Hs.6614 
N31598 Hs.12727 
AI740792 Hs.167531 
BE612992 Hs.27931 
AA992380 

BE384447 Hs.16034 
AL1 17430 Hs.6880- 
BE092285 Hs.29724 
H04360 Hs.24283 
NMJ305664HS.24587 
AK002180 Hs.11449 
AK001980 Hs.24284 
AW613287 Hs.80120 
N63823 Hs.269115 
AB037807 Hs.83293 
N46180 Hs.122489 
R67419 Hs.21851 
AL050166 Hs.26295 
AK000136 Hs.10760 
AI815486 Hs.243901 
AI815486 Hs.243901 
N67603 Hs.272130 
AW139408 Hs.152940 
AB037782 Hs.15119 
AA852773 Hs.334838 
N90956 Hs.17230 
AA778711 Hs.4310 
AB033091 Hs.74313 
Hs.34504 
Hs.334728 



DKFZP434I1 16 protein 

gb:yq94a01.s1 Soares fetal fiverspleen 1NRS Homo sapiens cDNA clone IMAGE:203400 3' 

hypothetical protein FU1 1016 

EST 

hypothetical protein RJ10298 
ESTs 

KIAA0442 protein 

hypothetical protein MGC11256 

hypothetical protein RJ127G7 

hypothetical protein MGC2963 

hypothetical protein FU20315 

Homo sapiens cONA FU11812 fis. done HEMBA1006364 

dpy-30-like protein 

FK506 binding protein precursor 

ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
ESTs, Weakly similar to A43932 mudn 2 precursor, intesfinal fHsapiens] 
hypofte0calprotehFU21610 
methylcrotonoyl-Coenzyme A carboxylase 2 (beta) 



AI523913 
T99755 

AA837396 Hs.263925 

H58589 Hs.35156 



AI47865B 
N94606 



Hs.94631 
Hs.288969 



AK000987 Hs.169111 
NMJX)3896Hs.225939 
W46342 Hs.325081 



R02354 
AI051194 
W90638 
R10720 
R52656 



Hs.15999 
Hs.227978 
Hs.20321 
Hs.20670 
Hs.21691 



AB037834 Hs.18685 
BE298665 Hs.14846 
AW083791 Hs.21263 
NM.015310Hs.6763 
R41823 Hs.7413 
AB029000 Hs.70823 
R46071 Hs.301693 
NMJ)16248Hs.232076 
AW007287 Hs.28538 
AI742756 Hs.26079 



■ 7r ...v«». r wu. I ii hwiwwi guiiiKu luyiuwacuiuim-piAjspnaiB r^ceiyuransiefase 
gb:ot37g06.$1 SoaresJestis,NHT Homo sapiens cDNA done 3' similar to contains element 
hypothetical protein MGC13186 
DKFZP434D156 protein 
hypothetical protein FU13187 

ESTs, Moderately similar to reduced expression in cancer (H.sapiensl 
signal transduction protein (SH3 containing) 
DKFZP5640123 protein 

ADP-nbosyltransferase (NAD+; poiy(ADP«fa>ose) polymerase)* 2 ] 
UDP^^tyMpha^alactosamIne:pofypepfide N-acetylgalactosamlnyltransferase 1 (GalNAc-TU 
ESTs, Moderately similar to Z195JWMAN ZINC FINGER PROTEIN 195 [H^apiens] ■ 1 

hypothetical protein 

Homo sapiens cDNA FU13289 fis, done OVARC1Q01 170 
Homo sapiens cONAFU 12900 lis, done NT2RP2004321 
Homo sapiens mRNA; cONA DKFZp586D1 122 (from done DKFZp586D1 122) 
asporin (LRR class 1) 

Homo sapiens cDNA FU20738 fis, done HEP08257 
Homo sapiens cONA FU20738 fis, done HE 
ESTs, Weakly similar to S65824 reverse transcriptase homotog (H.saplens) 
ESTs 

KIAA1361 protein 
KIAA1866 protein 
hypothetical protein FU22087 
eukaryotic translation initiation factor 1 A 
KIAA1265 protein 
ESTs 
ESTs 

US1-interacting protein NUDE1, rat homotog 
Homo sapiens cDNA FU11027 fis, done PLACE 10041 14 
brefeldin Adhibited guanine nudeotide-exchange protein 1 
HSCARG protein 
oxidation resistance 1 

sialyltransferase 9 (CMP-NeuAcJactosyteeramide aIpha-2,3-sialyitransferase; GM3 synthase) 
Homo sapiens, done IMAGE:3659680, mRNA, partial cds 
ESTs 
EST 

ESTs. Moderately similar to ZRF1JWMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 
EST 
ESTs 

Homo sapiens mRNA for KIAA1413 protein, partial cds 
Homo sapiens mRNA; cDNA DKFZp564D016 (from done DKFZp564D016) 
suppressor of potassium transport defect 3 
K1AA0942 protein 
ESTs; calsyntenln-2 
K1AA1077 protein 

Homo sapiens, done IMAGE:3638994, mRNA, partial cds 
A kinase (PRKA) anchor protein 11 
Homo sapiens cONA: FU21086 fis, done CAS03272 
ESTs 

120 



6.2 
6.1 
6.1 
6.3 
1.3 
1.8 
1.6 
7.6 
2.5 
9.3 
5.5 
2.1 
1.5 
6.6 
5.7 
3.4 
1.7 
1.7 
4.7 
2.3 
3.5 
2.2 
2.6 
1.9 
6.7 
2.0 
1.3 



1.8 



3.6 
2.1 
2.3 
3.7 
7.5 
7.1 
6.7 
3.3 
3.6 
1.5 
2.6 
4.6 
7.9 
6.9 
5.0 
3.8 
1.2 
5.1 
2.2 
2.8 
2.2 
2.1 
5,1 
8.4 
27 
6.5 
1.4 
1.6 
1.6 
2.4 
10.6 
6.6 
5.1 
2.8 
14,6 
9.0 
1.4 
1.4 
3.2 
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112513 R68425 Hs.13809 hypofoefical proteh FU10548 on 
112752 AK001635 Hs.14838 hypothetical protein RJ10773 A 
S S ^ 0M P !ens ^ AfwFUOT WpiDtem (P a^cds Jg 

5 11293B AW970826 Hs.6185 KIAA1557 protein H 
112958 R61388 Hs.6724 ESTs 11 
112966 Z44718 Hs. 102548 glucocorticoid receptor DNA binding factor 1 l A 

112978 AK000272 Hs.7099 hypothetical protein aJ20265 *A 

10 55™?S ^ 7155 EST ^^ era tetysirnaarto2115357ATYWprotein(M.mu^^ 56 

10 112996 BE276112 Hs.7165 zinc finger protein 259 ] 5*5 

113047 AI571940 Hs.7549 ESTs * ° 

ll^S 1 *!^ 580 Homo sapiens mRNA for KIM1729 protein, partial cds 24 
113089 T40707 Hs.270862 ESTs 7? 

15 IS S 9 S? b ?JfS- S J ^ tasene fetal a P teen C937205) Homo sapiens cDNA clone IMAG£:74G68 3", 1.7 

15 IK K«0 H.11449 S » 

l!S SS K!«Sb ^[f (PeP^rofyl ds/trans isomerase) Nl^teracflng, 4 (parvufin) g 

113499 A1467908 Hs.8882 ESTs H 
20 113547 H59588 Hs.15233 ESTs ™ 

113554 AW503990 Hs.142442 HP1-8P74 3g 

^fl 3887 Hs>188173 Homo sapiens cDNA FU12187 fls t done MAMMA1000831 u 

1lS 3Ei« HS184411 figf^ 4>4 

25 SB SBf K5> 

113783 AL359588 Hs.7041 hypolhefieal proteh DKFZp762B226 ] 7 
113791 AI269095 Hs.135578 chitobiase. dW4-acetyl- , 

113308 W44735 Hs.9286 Homo sapiens cDNA: FU21278 fc. done COL01832 H 

30 1221 5^ 7480 S 86994 *"» «Pfens cDNA: FU22044 fls, done HBP09141 H 
113817 H13325 Hs.332795 hypothetical protein 0KFZp761O17121 
113826 AW378212 H&24809 hypothetical proteh FU10826 

)22£ ISE {I s - 6059 EGF-confainlng fibufin-fika extracsUularniaWx protein 2 U3 

H ™n S?« y-SSi P^ OTe 0»osonie,macropah)26Ssubtmit,non-ATPase,11 27 

40 111™ Hs - 120969 Homo sapiens cDNA FU11562 fe, clone HEMBA1003197 54 

S S S 3 ^ 78 ^ e ^PWtehaj21939 5 imilarto5^acytidinehducedgene2 94 

114060 AB029551 Hs.7910 RING1 and YY1 bhdhg proteh «'« 

114196 AF017445 Hs. 150926 nioose-1-phosphateguanylyltransferase Yk 

114226 AB028968 Hs.7989 NMIMSjmtti j| 

45 ffiESKST SS^5SS W,ar * P"W" I* (dona pHZJO). done Mec:10647, mRNA. conpl^ oda 2.3 
114275 AW515443 Hs.306117 KIAA0306 protein i 
114292 AI815395 Hs.184641 fettyaciddesalurase2 „ 

114309 AA332453 Hs.20824 CQI-85 proteh ~ . 

<?n «££ ^™?° ?-^ 748 IF* Weakly slmilartoA28996 prdlne-rich protein M14 precursor- mouse IMjnusculus] U 

50 « 1 !UE S? 76 ?\P 305 H ™ s ^«^cDMDKI^B0425(ta^ 1 

!1S S47 Hs.40109* S^g^^*^^^^^^ I 

S3! US S5S5 K h 1S'^ towra ^ 11imis ^^ , ^ GE ^ 1 ^^^ 

IS jSiST !S2£ BSB83** K,MB * I 

*n ^ll 9 £ 5 , i1 S *]^ 3 ^romosorne maintenance deficient (S. cerevisiae) 4 A 

60 114774 AV656017 Hs.184325 CGI-76 protein If 

114798 AA159181 Hs.54900 serologlca&y defined colon cancer antigen 1 11 

114860 AL157545 Hs.42179 bromodornaln and PHD finger containing 3 A 

114895 AA236177 HsJ6591 WAA0887 protein J ? 

114896 BE539101 Hs.5324 hypotheficaJ protein \\ 

"Si Hs.188717 | b s ^ Sl ^ 0 ^^^ 1.5 

114938 AA242834 Hs.58384 ESTs J2 

2.9 
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114965 A1733881 Hs.72472 BMP-R1B 

115023 AF102546 Hs.63931 dachshund ProsophUa) homolog 

115038 AA252360 Hs.87968 toltfke receptor 9 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length Insert cONA clone EUROIMAGE 1913076 

115062 AA253314 Hs.154103 UM protein (similar to rat protein kina 
115117 AI670847 Hs.5324 hypothetical proteei 

115121 AI634549 Hs.88155 ESTs 

115206 AW183695 Hs.186572 ESTs 

115221 AW365434 Hs.73741 hypothec protein FuiOl 16 

115239 BE251328 Hs.73291 hypothetical protein RJ10881 

115242 AI368236 Hs.283732 ESTs, Moderately similar to ALU1 _HUMAN ALU SUBFAMILY J SEQUENCE 

115278 AK002163 Hs.301724 hypothetical protein FLJ 11301 

115285 AW972872 Hs.293736 ESTs 

115291 BE545072 Hs.122579 hypothetical protein RJ10461 

115400 AI215069 Hs.89113 ESTs 

115468 AA314349 Hs.48499 tumor antigen SLP4p 

115471 AK001376 Hs.59346 hypothetical protein FU10514 

115479 AW301608 Hs.278188 ESTs, Moderately similar to 154374 gene NF2 protein [Rsapfens] 

115496 AW247593 Hs.71819 eukaryotic translation Initiation factor 4E binding protein 1 

115500 Y14443 Hs.88219 zinc finger protein 200 

115553 AJ275986 HsJ1414 transcription factor (SMIF gene) 

115581 AI540842 Hs.61082 ESTs 

115587 BE081342 Hs.283037 HSPC039 protein 

115590 AA399477 Hs.67896 7-60 protein 

115646 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), member 10 

115652 BE093589 Hs.38178 hypothetical protein FU23468 

115655 AL048269 Hs.288544 Homo sapiens, clone MGC:1 6063, mRNA, complete cds 

115663 AI138785 Hs.40507 ESTs 

115676 AA953006 Hs.88143 ESTs 

115690 AA625132 Hs.44159 hypothetical protein FU21615 

l!c7?c IFf?" 3 cadhertn ' E6F UG 5evef Hass G-type receptor 3, flamingo (Drosophila) homolog 

115715 BE395161 Hs.1390 proteasome (prosome, macropain) subunlt, beta type, 2 

115734 AI950339 Hs.40782 ESTs 

115811 NM.015434HS.4B604 DKF2P434B168 protein 

115823 AI732742 Hs.87440 ESTs 

115837 AI675217 Hs.42761 ESTs 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 

115866 AW062629 Hs.52081 K1AA0867 protein 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 

115941 AI867451 Hs.46679 hypothetical protein FU20739 

115968 AB037753 Hs.62767 KIAA 1332 protein 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cONA clone EUROIMAGE 2005735 
116108 AA770688 Hs.28777 H2A hlstone famDy, member L 
116134 BE243834 Hs.50441 CGI-04 protein 

116195 S 9 113 Hs*72402 j[f£^^ s ™ arto ^^ 

116238 AV660717 Hs.47144 DKF2P586N0819 protein 

116246 AF265555 Hs.250646 baculovfral IAP repeat-containing 6 

116262 AI936442 Hs.59838 hypothetical protein FU10808 

116298 AI955411 Hs.94109 Homo sapiens cDNA FU13634 Os, clone PLACE101 1133 

116318 AF097645 Hs.58570 deleted In cancer 1; RNA helicase HDB/DICE1 

116325 AI472106 Hs.49303 Homo sapiens cONAFUl 1663 lis, clone HEMBA1004631 

116336 AL133033 Hs.4084 KIAA1025 protein 

116339 AK000290 Hs.44033 dipeptidyl peptidase 8 

116350 AA497129 Hs.184771 nuclear factor 1/C (CCMT-binding transcription factor) 

116358 AI149586 Hs.38125 interferon-induced proteii 75. 52kD 

116365 N50174 Hs.46765 ESTs 

116368 N90466 Hs.71109 K1AA1229 protein 

116417 AW499664 Hs, 12484 Human clone 23826 mRNA sequence 

116436 AA161411 Hs,58668 (^rornosome 21 open reading frame 57 

116462 AF218313 Hs.236828 putative heGcase RUVBL 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

116470 AI272141 Hs.83484 SRY (sex determining region Y>box 4 

116575 AA312572 Hs.6241 phosphoinoslOde^-klnase, regulatory subunlt, polypeptide 1 (p85 alpha) 

116637 £{£01043 H s.92033 IntegrWinked Wnasesassoclated serinefllireonlne phosphatase 2C 

116640 X89984 Hs.211563 B-ceD CLUIymphoma 7A 
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116700 A18G0202 Hs.317589 hypothetical protein MGC10765 1d 

116705 AW074819 Hs.12313 hypofoeflcal protein FU 14566 34 

116732 AW152225 Hs.165909 ESTs. Weakly simflar to 136022 hypothetical protein [H.sapiens] 2 9 

116921 AW068115 Hs.821 biglycan tl 

116926 H73608 Hs.290830 ESTs fj 

117034 U72209 Hs.180324 YY1 -associated factor 2 i'l 

117132 AI393666 Hs.42315 poinding protein 5 2 

117276 N711B3 Hs.121606 Homo sapiens cDNA RJ11971 fis, clone HEMBB1001208 1 5 

!^S£ 9 ft™ Sapiens ^ f^ 10590 fe ' NT2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

11/00/ AJQ41793 Hs.42502 ESTs 20 

117382 AH5M75 H* 40173 j^^ 9, H+ ^^^ so ^ al 2^1 

117412 N32536 Hs.42645 sotateamk* family 16 (monocarbcwylic acid transporters), m 14 

117557 AF123050 Hs.44532 cftubtquOin , ; 

117588 N34895 Hs.44648 ESTs * A 

117745 BE294925 Hs.46680 CGl-12 protein 3 0 

117754 AA121673 Hs.59757 zinc fingerprotein 281 19 

1 17879 N54706 Hs.303025 chromosome 11 open reading frame 24 . 1*8 

117881 AF161470 Hs.260622 butyrate-lnduced transcript 1 57 

117904 BE540675 Hs.332938 hypoflietical protein MGC5370 * 59 

117911 AL137379 Hs.47125 hypotheficai protein FU13912 J 7 

1 17933 Y10518 Hs.1 16470 hypotheficai protein FU20048 1 7 

117983 AL110246 Hs.47367 WAA1785 protein el 

118078 N54321 Hs.47790 EST 11 

118301 AA453902 Hs.293264 ESTs 2 6 

118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 25 

118472 AL157545 Hs.42179 bromodomaln and PHO finger containing. 3 a\ 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) - 12 

118509 N22617 Hs.43228 Homo sapiens cONA FU11835 lis, clone HEMBA1006595 15 

118528 AI949952 Hs.49397 ESTs 74 

118656 AJ458020 Hs.293287 ESTs 2*5 

1 18670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 ffisapiensl 1 2 

118698 AB033113 Hs.50187 KIAA1 287 protein 21 
ISS . gb:zq75g09.r1StratagenehNT neuron (937233) Homo saptenscDNA clone IMAGE:647488 5* 5^2 
ISS *??L Hs ' 206832 ESTs, Moderately similar to ALU8J-IUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
Al663709 Hs.240722 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs.125830 bladder cancer overexpressed protein 4 a 

119206 W24781 Hs.293798 WAA1710 protein t 7 

119235 AW453069 Hs.3657 activity-dependent neuroprotective protein 2*2 

119235 AW453Q69 Hs.3657 actMty^pewIent neuroprotective prate 16 

119265 BE539706 Hs.285363 ESTs ■ 4 

119279 N57568 Hs.48028 EST nc< 

119298 NM_001241Hs.155478 cycSnT2 72 

119338 AM17240 Hs.320836 ESTs, Weakly similar to A47582 B-cefl growth factor precursor [H^aplens] 13 

119349 T65004 Hs.163561 ESTs 04 

119403 AL117554 Hs.119908 nucleolar protein NOP5/NOP58 67 

119478 AI624342 Hs.170042 ESTs S i 

119486 AI796730 Hs.55513 ESTs 21 

119513 W37933 Empirically selected from AFFX single probeset 1*9 

119601 AK000155 Hs.91684 Homo sapiens mRNA; cDNA DKFZp667H03 (from clone DKFZp667l103) 37 

119602 AW675298 Hs.233694 hypothetical protein FU11350 ' on 
119676 AA243837 Hs.57787 ESTs 7* 
119682 W61019 Hs.57811 ESTs A 
119774 AB032977 Hs.6298 KIAA1 151 protein 71 
119780 NM_016625Hs.191381 hypothetical protein 31 
119789 BE393948 Hs.50915 kaOlkreln 5 (KLK5; KLK-L2; stratum comeum trypfic enzyme) 9^ 
lln^a £55521 ^H 13 ^ST^ Weakly similar to IEFS_HUMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 3.6 
119818 AA130970 Hs.58382 hypotheficai protein FU1 1101 05 
JjjgE W81218 Hs.58608 Homo sapiens cDNA FU14206 fis, clone NT2RP3003157 2 7 
llllf. Hs.119571 collagen, type III, alpha 1 (EhJers-Danfos syndrome type IV, autosomal dominant) ^6 
1 199oo AA703129 Hs.58963 ESTs «y 
120132 W57554 Hs.125019 lymphoid nuctear protein (LAF-4) mRNA To 
^SS Hs * 91668 Homo sapiens done PP1498 unknown mRNA 457 
120248 AI924294 Hs.173259 urKharacterfeed bone marrow protein BM033 * 19 
120253 AA131376 Hs.326401 fibroblast growth factor 12B <wq 
120269 AW131940 Hs.104030 ESTs f* 
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120274 AA177051 gb:nc02a02.s1 NCI_CGAPJ>r3 Homo sapiens cDNA done IMAGE:194 similar to contains Alu 4.6 

120280 AA190577 gb:zp52g02.s1 Stratagene HeLa cell s3 937216 Homo sapiens cONA clone 3\ mRNA sequence 2.0 



120296 AW99591 1 Hs.299883' hypothetical protein FU23399 1.8 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195 HUMAN ZINC FINGER PROTEIN 195 [H^aplens] 152 
5 120324 AA195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120336 N85765 Hs.181 165 eukaryotic translation elongation (actor 1 alpha 1 2.9 

120342 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 5.7 

10 120345 AA210722 Hs.104158 ESTs 4.5 

120349 AW969481 Hs.55189 hypothetical protein 16.8 

120352 R06859 Hs.193172 ESTs, Weakly simflarto 138022 hypothetical protein [H.sapiens] 5.0 

120356 AF000545 Hs.296433 putative purinerglc receptor 28.1 

120371 AA219305 Hs.104196 EST 12.4 

15 120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primay response (LRPR1, rat) homotog 1 9.7 

120386 AW969665 Hs.154848 hypothetical protein DKF2p434D0127 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALLT7..HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 217 
20 120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 12.5 

120404 AB023230 Hs.96427 KIAA1 01 3 protein 72 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from clone DKFZp586F1323) 1 1.4 

120423 AA236453 Hs.18978 Homo sapiens cONA: FU22822 fis, clone KAIA3968 19 

120472 AI950087 gb:w<i05c02.x1 NCLCGAPJW^ 19.4 

25 120473 AA251973 Hs.269988 ESTs 5.4 

120484 AA253170 Hs.96473 EST 10.4 

120504 AA256837 gb2r84d10.s1 Soares.NhHMPu.S1 Homo sapiens cDNA clone IMAGE:682387 3*. mRNA sequence. 3.9 

120509 BE047718 Hs.96545 ESTs 9.4 

120520 AA258601 Hs.161731 EST 14 

30 120535 BE350244 Hs.96547 ESTs 2.5 

120551 AA279160 Hs.111407 Homo sapiens, clone IMAGED 3029, mRNA, partial cds 5.2 

120570 AA280679 Hs.271445 ESTs, Weakfy similar to ALU1_HUWAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs.284228 ZNF135-Gke protein 102 

120590 AW372799 Hs.125790 leudne-rich repeakantaining 2 2.1 

35 120596 AA282074 Hs.237323.N-acety^iucosamIne-phosphatemutase 7.5 

120619 AW965339 Hs.111471 ESTs 2.5 

120624 AW407987 Hs.173518 M-phase phosphoprotein homotog 52.0 

120639 AA286942 gb:zs56K)5.s1 NC|_CGAP_GCB1 Homo sapiens cONActonelMAGE:701505 3' similar to contains AiuZ4 

120648 AA287095 Hs.140309 Homo sapiens, done IMAGE:36771 94. mRNA. partial cds . 5.0 

40 120653 AW063659 Hs.191649 ESTs 22 

120668 AW969638 Hs.1 12318 6.2 kd protein 22 

120669 BE536739 Hs.109909 ESTs 1.9 
120695 AA976503 gbxxj30a04.s1 NCI_CGAP_GC4 Homo sapiens cONA clone 3' similar to contains PTR7.M PTR7 46.8 
120698 AI821539 Hs.97249 ESTs 2.5 

45 120713 AW449855 Hs.96557 Homo sapiens cDNA FU 12727 fis, done NT2RP2000027 5.9 

120718 AA292747 Hs.97296 ESTs 2.9 

120750 AI191410 Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor (H.sapiens} 7.0 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-contalning protein SH3GLB2; KIAA1848 protein 6.8 

50 120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cONA 3* end similar to EST containing O family repeat, 4.4 

120938 AA386260 Hs.104632 EST 4.4 

120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs , 5.6 

120985 AI219896 Hs.97592 ESTs 1.2 
55 121011 AA398360 Hs.97608 EST 3.1 

121026 AM39713 Hs.165295 ESTs 35 

121081 AA398721 Hs.186749 ESTs. Highly similar to I37550 mismatch repair protein MSH2 [H^aptens) 5.4 

121133 AA363307 Hs.97032 ESTs 3.7 

121176 AL121523 Hs.97774 ESTs 1.7 

60 121223 AI002110 Hs.97169 ESTs. Weakly similar to (U667H1 2^.1 fH^apiens] 2.9 

121320 AA403008 Hs.301927 c6.1A 1.9 

121340 AW956981 Hs.97910 Homo sapiens CDNAFU13383 fis. done PLACE1001 024 3.5 

121408 AA406137 Hs.98019 EST 6.0 

121439 AA4 10190 Hs.98076 ESTs, Weakly similar to A47582 B-cell growth factor precursor (H^aplens] 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, clone MGC:18257, mRNA, complete cds 6.9 

121452 AW971063 Hs.292882 ESTs 1.8 

121455 H58306 Hs.15165 retinolc add induced 14 105 
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121457 W07404 Hs.144502 hypothetical protein HJ22055 34 
121496 AA442224 Hs.97900 ESTs 144 
121505 AA494172 Hs.194417 ESTs \ 3 \ 
121508 AA402515 Hs.97887 ESTs 280 
5 121513 AA416653 Hs.181510 ESTs 6 2 

121549 AA412477 Hs 98142 Isf 69 * 02 * 51 Soares - tesBs - NHT Homo sa P tens cDNA done IMAGE727563 3 1 , mRNA sequence. Z6 

121558 M412497 gb:*t95g12.s1 SoaresJestis_NHT Homo sapiens cDNA clone IMAGE:7301 50 3' similar to contains2.8 

121577 AA411970 Hs.98096 EST 35 
10 121581 AA416568 gb:zu05c10.s1 SoaresJestisJIHT Homo sapiens cDNA done 3\ mRNA sequence 61 

121589 AO001528 Hs.89718 spermine synthase 30 
121594 AA626010 Hs.98247 ESTs 2*0 

121622 M416931 Hs.126065 ESTs \l 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (from done DKFZp434B1023) 7.8 

ID 121682 M418160 Hs.86043 Homo sapiens cDNA FLJ13558 fis, done PLACE 1007743 2 0 

121690 AV660305 Hs.110286 ESTs 47 
121706 U55184 Hs.154145 hypothetical protein RJ1 1585 ^7 

121714 M4 19225 Hs.98269 Homo sapiens cDNA FU1 1953 fis, done HEMBB1000883 81 

121729 AI949597 Hs.98325 ESTs !o 

20 121731 AA421041 Hs.180744 ESTs Jn 

121744 AA398784 Hs.97514 ESTs f[ 

121748 BE536911 Hs.234545 hypothetical protein NUF2R ig* 

121773 AB033022 Hs.158654 KIAA1 196 protein 79 

121775 AA421773 Hs.161008 ESTs 17 

25 121776 AA292579 Hs.125133 hypothetical protein FU22501 66 

121786 A1810774 Hs.98376 ESTs 105 

121832 AW340797 Hs.98434 ESTs 5 8 

121836 AA328348 Hs.218289 ESTs 3 ' 8 

121839 AA425691 Hs.191606 ESTs, Highly similar to KIAA1048 protein (H.sapiens] 5 0 

30 121842 AF027405 Hs.104865 serine/threonine kinase 23 2 7 

121847 AA446628 Hs.2799 cartilage linking protein 1 23 

121871 AW972668 Hs.293044 ESTs ~ 29 

121882 AA426376 Hs.98459 ESTs 5 0 
121911 AA427950 gb:zw50f02.s1 SoaresJotaLfetusJfo2HF8_9w Homo sapiens cDNA clone IMAGE:773499 3' 7^2 

121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-Bnked retinopathy protein [Rsapiensl 25 

121935 AA428647 Hs.98611 EST 2 3 

121983 AA298760 Hs.180191 hypotheticaJ protein FU14904 3 4 

121985 AI862570 Hs.299214 Homo sapiens, clone IMAGE:2822295, mRNA, partial cds 114 

121995 AA210863 Hs.3532 nemolike kinase 3 8 

40 121999 AA430211 Hs.98668 EST 6 ' 4 

122009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863 fis, done ADKA01804 22 

122013 AA431085 Hs.98706 ESTs 65 
122036 W92142 Hs.271963 ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CO^^TAMI^iATJO^I 13 1 

122050 AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, DrosophilaHike 2 91 

45 122060 AA431738 Hs.98750 EST «« 

122114 AW161023 Hs.104921 ESTs 15 

122188 AA398838 gb:zt80d01.r1 SoaresJestisJJHT Homo sapiens cDNA done 5 1 , mRNA sequence iz 

122204 AA435936 Hs.98842 EST ^ 5 5 

122246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei s'l 

50 122257 AA436819 Hs.98899 ESTs 56 

122302 AA441801 Hs.104947 ESTs 5 ' 8 

122341 AW601969 Hs.99010 hypofiieUcal protein FU22263 similar to nudearreceptor-oindingSET-doma&i protein 1 10 

122356 AA443794 Hs.98390 ESTs 7 3 

122369 AA443985 Hs.303222 ESTs 7 2 n 

55 122371 AA868555 Hs.178222 ESTs 50 

122372 AA446008 Hs.336677 EST 7 g 

122378 AB032948 Hs.21356 hypothetical protein DKF2p762K20l5 2.5 

122405 AA446572 Hs.303223 EST 2 o 

122412 AA446869 Hs.119316 ESTs 73 

60 122415 AA446918 Hs.99088 EST 1 g 

122418 AA446966 Hs.99G90 ESTs, Moderately similar to simDar to WAA0766 [H.sapiens] 63 

122440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from done DKFZp547C244) 2 6 

122446 AA447603 Hs.99123 EST 71 

122448 AA447626 Hs.99127 EST 35 

65 122458 AI266159 Hs.104980 ESTs * 5 

122460 AW418788 Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenorhabditis eiegans (Celegansl 97 

122464 AA448158 Hs.99152 EST ^ J ^ 
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122490 AA448349 Hs.238151 EST 6.1 

122492 AA446417 Hs.104990 ESTs 5.4 

122502 AA204969 Hs.234863 Homo sapiens cDNAFUl 2082 lis, clone HEMBB1002492 1.3 

122510 AA449232 Hs.99195 ESTs 11.2 

5 122530 AW959741 Hs.40368 adaptor-related protein complex 1 t s^ma2 subunit 10.1 

122547 AA779725 Hs.164589 ESTs 2.5 < 

122555 AA194055 Hs.293858 ESTs 1.9 

122570 AA452578 Hs.262907 ESTs 9.5 

122572 AA452601 Hs.99287 EST 110 

10 122586 AK001910 Hs.99303 Homo sapiens cDNA RJ1 1048 fis, clone PLACE1004516 3.4 

122587 AB040893 Hs.6968 KIAA1460 protein 2.0 

122598 AI028173 Hs.99329 ESTs 1.7 

122599 AL355841 Hs.99330 hypothetical protein FU23588 4.4 
122602 AA411925 Hs.301960 ESTs 4.6 

15 122607 AA453518 Hs.98023 ESTs 61.5 

122614 AA453630 Hs.99339 EST 10.7 

122616 AA453638 Hs.161873 ESTs 107.3 

122617 AI681535 Hs.148135 serine/threonine kinase 33 121.4 

122618 AA453641 gb:zx48e06.s1 SoaresJestisJJHT Homo sapiens cONA clone 3', mRNA sequence 31.1 
20 122622 AA453987 Hs.144802 ESTs 5.6 

122717 AA456859 Hs.178358 ESTs 8.5 

122762 AI376875 Hs.105119 ESTs 10.4 

122829 AW204530 Hs.99500 ESTs 81.8 

122834 AA461492 Hs.99545 Homo sapiens cONA FU10658 fis, done NT2RP2006052 3.6 

25 122836 AA460581 Hs.290996 ESTs 4.5 

122837 AA461509 Hs.293565 ESTs, Weakly similar to putative p150 [Rsapiens] 2.7 

122838 AA460584 Hs.334386 ESTs 75.3 
122854 AA600235 Hs.9625 NIMA (never in mitosis gene ablated kinase 6 7.7 
122856 AI929374 Hs.75367 Src-fike-adapter 5.8 

30 122861 AA335721 Hs.119394 ESTs 1,3 

122866 BE539656 Hs.283705 ESTs 4.1 

122868 AF005216 Hs.1 15541 Janus kinase 2 (a protein tyrosine kinase) 5.3 

122870 AW576312 Hs.318722 Homo sapiens cDNA: FU21 766 fis, clone C0U7179 9,9 

122872 AW081394 Hs.97103 ESTs 5.3 

35 122879 AA769410 Hs.128654 ESTs 13.9 

122907 AA470074 Hs.169896 ESTs 11.5 

122916 AA470140 Hs.229170 EST 1.7 

122981 AA478951 Hs.105629 ESTs 5.0 

123013 AW968324 Hs.17384 ESTs 15.4 

40 123016 AW338067 Hs.323231 Homo sapiens cONA RJ1 1946 fis, clone HEM6B1000709 2.8 

123034 AL359571 Hs.44054 ninein (GSK3B interacting protein) 8.7 

123072 A1382600 Hs.104308 ESTs, Weakly similar to K1AA1395 protein [H^aplensl 8.8 

123082 AA485360 Hs.105661 ESTs 3.9 

123088 AI343652 Hs.105667 ESTs 3.8 

45 123110 M486256 Hs.193510 EST 7.4 

123114 BE304942 Hs.265848 myomegalln 2.8 

123131 T52027 Hs.271795 ESTs, Weakly slmOar to I38022 hypothetical protein [H.saplens] 2.4 

123132 AI061582 Hs.324179 Homo sapiens cDNA FU12371 fis, clone MAMMA1002434 15.6 
123136 AW451999 Hs.194024 ESTs 5.1 

50 123149 AI734179 Hs.105676 ESTs 23.8 

123152 AW601773 Hs.270259 ESTs 5.2 
123258 AA490929 Hs.105274 ESTs, Weakly similar to RMS1_HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [Haptens] 9.3 
123315 AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cONA clone IMAGE:755827 3' similar to4.1 

123369 AA504757 Hs.105738 ESTs 6.9 

55 123394 AA731404 Hs.105510 ESTs 3.6 

123433 AW450922 Hs.112478 ESTs 3.7 

123466 AA599042 Hs.112503 EST 7.4 

123470 AW303285 Hs.303632 Human DNA sequence from clone RP1 1-1 10H4 on chromosome 5 Contains a pseudogene similar to 3.5 

123471 AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 5.2 
60 123475 BE439553 Hs,250528 Homo sapiens, done IMAGE:4098694, mRNA, partial cds ' 1.7 

123482 N95059 Hs.55098 ESTs 1.6 

123486 BE019072 Hs.334802 Homo sapiens cONA FU 14680 fls, clone NT2RP2004242, weakly similar to 14 

123508 AW380388 Hs.155546 KIAA1080 protein; Golgi-associated. ganima-^ptHiearcontalnJng.ARF-bindlng protein 2 2.2 

1 23615 AA609170 gbaM2a12.s1 SoaresJestisJJHT Homo sapiens cONA clone 3', mRNA sequence 7.8 

65 123619 AA602964 gb:no97o02.s1 NCLCGAPJ>r2 Homo sapiens cONA clone, mRNA sequence 2.8 

123658 AA609364 gbzu71d09.s1 SoaresJestls_>IHT Homo sapiens cDNA clone IMAGE743441 3* similar to contains Alu, 1.7 

123674 A1269609 Hs.105187 kinesin protein 9 gene 5.7 
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123735 NNLQ13241HS.95231 FH1/FH2 domain-containing protein 10-0 

123738 AA609891 Hs.1 12777 EST 52 

123753 AA609955 Hs.234961 Hunlingtin interacting protein E 30.6 

123804 AA620464 Hs.261915 EST, WeaWy similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 2.1 

123811 AA620586 gbae60g05.s1 Stratagene lung carcinorna 937218 Homo sapiens cDNA clone IMAGE:951 320 3' 2,7 
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123951 AB012922 Hs.173043 metastasis-associated 1^ke 1 
123983 AJ272267 Hs.146178 choline dehydrogenase 
124001 L42542 Hs.75447 ralA binding protein 1 
124006 AM47155 Hs.270016 ESTs 
10 124070 AI950314 Hs.154762 HIV-1 rev binding protein 2 

124074 H05635 Hs.294030 topdsomerase-related function protein 4-2 
124178 BE463721 Hs.97101 putative G protein-coupled receptor 
124203 AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC093 (H^apiens] 
124352 AA640891 Hs.102406 ESTs 
15 124375 D87454 Hs.192966 KIAA0265 protein 

124385 AI267847 gbaq49a10jc1 Stanley Frontal NB pool 2 Homo sapiens cDNA done similar to contains 

124390 AA317338 Hs.7535 COBW-like protein 

124391 AF155099 Hs.279780 NY-REN-18 antigen 

124417 N34059 gb:yv28h09.s1 Soares fetal liver spleen 1NRS Homo sapiens cDNA clone 3" similar to contains Alu 3.3 

20 124428 H13540 Hs.82202 ribosomal protein L17 2.9 
124440 AA532519 Hs.129043 Human DNAsequence from clone 989H11 on <*romosome22q1 3.1 -13.2. Contains part of a 
124466 R10084 Hs.113319 ktnesin heavy chain member 2 

124482 N53935 gb:yv59dQ9.$1 Soares fetal Bver spleen 1NFLS Homo sapiens cDNA clone 3\ mRNA sequence 

124498 H79433 Hs.268997 ESTs 
25 124515 AA669097 Hs.109370 ESTs 

124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7c-NTP [H^apiens] 
124631 NMJ>14053Hs.270594 FLVCR protein 

124634 AJ765123 Hs.143671 HonwsaplerBcONAFU13533fis > cionePLACE1006371 
124637 AA160474 Hs.75798 hypoflietical protein 
30 124642 AW968856 Hs.278569 sorting nexin 17 
124649 N92593 Hs.313054 ESTs 
124656 AW297702 Hs.102915 ESTs 
124661 R48170 Hs.78436 EphB1 

124683 AA381661 Hs.119878 ESTs, Weakly similar to M3K9_HUMAN MTTOGEN-ACTIVATED PROTEIN KINASE KINASE 
35 124712 R09166 Hs.191148 ESTs 
124735 R22952 Hs.268685 ESTs 

124761 AA374756 Hs.93560 Homo sapiens mRNA for KIM1771 protein, partial cds 
124768 AW368528 Hs.100855 ESTs 
124775 R41772 Hs.100876 ESTs 
40 124777 R41933 Hs.140237 ESTs, WeaWy similar to ALU1 JiUMAN ALU SUBFAMILY J SEQUENCE 
124788 R43543 Hs.100912 Homo sapiens cDNA:l : U22726 lis, clone HSI15005 
124809 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full insert 

124811 R46068 Hs.288912 hypoflietical protein FU22604 

124812 R47948 Hs.188732 ESTs 
45 124822 AA418160 Hs.86043 Homo sapiens cDNA FU1 3556 tis, done PLACE1007743 

124825 AA501669 Hs.336693 ESTs 
124833 AW975868 Hs.294100 ESTs 
124857 R63652 Hs.137190 ESTs 
124860 R65763 Hs.101477 EST 
50 124863 AI382555 Hs.127950 bromodomain-containing 1 

124876 AF135422 Hs.27059 GDP-mannose pyrophosphorylase A 
124878 BE397530 Hs.288057 hypothetical protein FU22242 

124902 H37941 Hs.101883 ESTs 

124903 AW296713 Hs.221441 ESTs , . , 
55 124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB J4UMAN till ALU CLASS B WARNING ENTRY 111 [H.sapiens] 

'124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein [H^apiens] 
124958 AI078645 Hs.431 murine leukemia viral (bmi-1) oncogene homotog 
124980 T40841 Hs.98881 ESTs 

125002 T59338 Hs.269463 ESTs, Weakly slmaar to ALUIJiUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
60 125047 T79815 Hs.279793 ESTs 
125051 T79956 Hs.100588 EST 
125056 T81310 Hs.100592 ESTs 
125101 AM72068 Hs.286236 K1AA1 856 protein 

125113 T96595 Hs.302270 ESTs, Weakly similar to ALUF_HUMAN 111! ALU CLASS F WARNING ENTRY III [H^apiens] 
65 125115 T97341 gfcye57e05.s1 Soares fetal Bver spleen INFLSHomo sapiens cDNA done IMAGE:121856 3' similar to 

125125 A1222382 Hs.240767 HiimanDhM sequence from done RP1-12G1 4 on ^ 

125147 W38150 Empirically selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 

125249 AA630863 Hs.131375 ESTs.Moderaterysirrtor to AUBJiUMAN III! ALU CLASS B WARNING ENTRY 111 [Haptens] 
125255 AF098162 Hs.118631 timeless (Drosophiia) homolog 

125279 AW401809 Hs.4779 WAA1150 protein J* 
5 125280 AJ123705 Hs.106932 ESTs 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FU21814 fis, clone HEP01068 

125660 AW292171 Hs.23978 scaffold attachment factor B J* 

125827 NMJ»3403Hs.97496 YY1 transcription factor y 

125891 U29589 H$.7138 cholinergic receptor, muscarinic 3 ™ 

10 126005 AW409701 Hs.1578 baculoviral 1AP repeafcxmtaining 5 (survivln) 14-3 
126202 AA157632 Hs.272630 vacuolar proton pump delta polypeptide 
126695 AA643322 Hs.172028 a disintegrin and metalloproteinase domain 10 

127050 AW411066 Hs.274351 CGI-89 protein 

127274 AW966158 Hs.58582 Homo sapiens cDNA FU12789 fis. clone NT2RP2001947 

15 128355 AW293012 Hs.161623 ESTs *•* 

128493 087466 Hs.240112 KIM0276 protein y 

128493 087466 Hs.240112 K1AA0276 protein J-J 

128522 BE173977 Hs.10098 putative nucleolar RNA helicase '! 

128527 AA504583 Hs.101047 transcription factor 3 (E2A Immunoglobulin enhancer binding factors E12/t47) i.d 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR14 [H.sapiens] 2.8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member w 

128599 NM_015366Hs.102336 Rho GTPase activating protein 8 2.3 

128604 AI879099 Hs.102397 GIOT-3forgoradotropinfoducfotetran^ • y 
128608 BE267994 Hs.102419 zinc finger protein 

25 128625 AB037841 Hs.102652 hypothetical protein ASH1 " 

128629 AL096748 Hs.102708 DKFZP434A043 protein ** 
128639 AW582962 HS.102B97 CGM7 protein 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepsilon ™ 
128656 AA458542 Hs.10326 coatomer protein complex, subunitepsilo . . _ I~ 

30 128658 BE397354 Hs.324830 olptheria toxin nssistance protein required fw « 
128670 AA975486 Hs.103441 Homosapiere.SirnBaitoRIKENcDM^^ 

128691 W27939 Hs.103834 hypothetical protein MGC5576 7.7 

128696 BE081143 Hs.225977 nuclear receptor coactivator 3 *J 

128700 Y15221 Hs.103982 small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 1* 

35 128714 T85231 Hs.179661 tubulin, beta 5 J* 

128717 AK001564 Hs.104222 hypofoeticai protein RJ10702 " 

128733 BE147740 Hs.104558 ESTs, Moderately similar to 138022 hypothetical protein [KsaptensJ 2.7 

128737 AF292100 Hs.104613 RP42homotog » 

128742 AA307211 Hs.251531 proteasome (prosome, macropaln) subunlt, alpha type, 4 y 

40 128746 AW70163 Hs.323342 adin related protein 2/3 complex, subunit 4 (20 kD) y 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-ceU originated protein kinase y 

128772 BE302796 Hs.105097 thymidine kinase 1. soluble * 
128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypeptide F 

128797 NKU302975HS.105927 stem cell growth factor; lymphocyte secreted C-type lectin 13.3 

45 128806 AW630942 Hs.106061 RDRNA-blnding protein y 
128814 AW248431 Hs.256526 nuclear prelamin A recognition factor 

128830 BE281170 Hs.106357 valosirwx)ntaining protein MPUflani4 *J 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein BJ1 3855 « 

50 128854 BE159181 Hs.168232 hypothetical protein RJ13855 y 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 y 
128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 

128871 AF189723 Hs.106778 ATPase,Ca++ transporting, type 2C, member 1 J* 

128891 F34856 Hs.292457 Homo sapiens, done MGC:16362, mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein tost In neoplasm beta y 

128920 AA622037 Hs.166468 programmed cell death 5 lrtnlo/4< ]i 

128925 R67419 Hs.21851 Homo sapiens CONAFU1 2900 fs, done NT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenlne 3-hydroxylase) 7.2 

128949 AA009647 Hs.8850 a dsintegrin arri rrtelailoproteirTOe dor^^ 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 J£ 

128959 AI580127 Hs.107381 hypothetical protein FU1 1200 ™f 

128965 AW150697 Hs.107418 ESTs ] \ 

128970 AI375672 Hs.165028 ESTs ]f 

128975 BE560779 Hs.284233 N1CE-5 protein ] 4 - u 

65 128979 AW271217 Hs.281434 Homo sapiens cDNA FU14028 fis, done HEMBA1003838 1.6 

128995 AI816224 Hs.1 07747 DKFZP566C243 protein J* 

129019 AI950087 gbwq05c02jc1 MCLOGAPJ<id12 Homo sapiens cDNA done 3*. mRNA sequence 2.9 
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129021 AL044675 Hs.173081 KIAA0530 protein 38 

129021 AL044675 Hs.173081 K1AA0530 protein £s 

129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 34 

129076 AW296806 Hs.326234 ESTs, Highly similar to T46422 hypothetical protein DKFZp434M2023.1 (H^apiensl 5 0 

129078 AI351010 Hs.102267 lysosomal jZ 

129088 AA744610 Hs.1 94431 pafladin 7 7 \ 

129095 L12350 Hs.108623 truornbospondin 2 2.7 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 209 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 3 rj 
129099 AF146074 Hs.108660 ATP-binding cassette, subfamily C (CFTR/MRP), member 5 5*8 
129136 W93048 Hs.250723 hypoffietical protein MGC2747 5 9 
129149 AA356620 Hs.108947 WAA0050 gene product 63 
129172 AW162916 Hs.241576 hypothetical protein PR02577 1*8 
129192 AA286914 Hs.183299 ESTs 2 'l 

15 129194 M150797 Hs.109276 latexin protein 3^ 

129198 N57532 Hs.109315 KIAA1415 protein 58 

129207 AI934365 Hs. 109439 ostec^rycin (osteoinductive factor, mimecan) 80 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 29 

129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 32 
20 129254 AA252468 Hs.1098 DKF2p434J181 3 protein " 26 

129255 A1961727 Hs.109804 H1 hlstonefarruTy, member X 7^3 

129288 W26392 Hs.1 10080 ESTs, Weakly similar to S13495 pregnancy zone protein [H^apiens] 9 6 

129296 A1051967 Hs.1 10122 ESTs 12 

129323 AA287239 Hs.5518 Homo sapiens cDNARJ1 131 1 fis, done PUCE101 01 02 5*1 

25 129340 H75334 Hs.1 1050 F-box only protein 9 4^ 

129347 BE614192 Hs.279869 melanoma-associated anOgen recognised by cytotoxic T lymphocytes j£ 

129362 U30246 Hs.1 10736 solute carrier family 12 (sodiurrVpotasslunVchbride transporters), me^ 6 7 

129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 8*6 

129370 AI686379 Hs.1 10796 SAR1 protein -1*4 

30 129372 NIVL016039HS.1 10803 CGI-99 protein 20 

129403 AF149785 Hs.1 1 1 126 pituitary tunror^ransforming 1 interacting protein 7*4 

129404 AI267700 Hs.317584 ESTs 50 
129404 AI267700 Hs.317584 ESTs 25 
129423 AA204686 Hs.234149 hypothetical protein FU20647 102 

35 129449 A1096988 Hs.1 11554 ADP-ribosytation factor-like 7 80 

129453 AW974265 Hs.1 11632 Lsm3 protein 32 

129482 AA188185 Hs.289043 spindlin 67 

129482 AA1881B5 Hs.289043 spindlin 3 ' 6 

129513 AW843633 Hs.306163 hypotheticaJ protein AL1 101 15 71 

40 129515 AF255303 Hs.112227 membrane-associated nucleic ackJ binding protein 9 5 

129527 AA769221 Hs.270B47 delta-tubulin 3 2 

129559 W01296 Hs.1 1360 hypoftetical protein FU14784 75 

129560 AA317841 Hs.7845 . hypothetical protefoMGC2752 &8 
129570 AI923097 Hs.1 1441 chromosome 1 open reading frame 8 2.0 

45 129575 F08282 Hs.278428 progestin induced protein 16 

129587 H14718 Hs.1 1506 Human clone 23589 mRNA sequence 6*8 

129588 BE408300 Hs.301862 postmeiotic segregation increased 2-like 9 14 
129591 N57423 Hs.179898 HSPC055 protein 7^3 
18594 AW403724 Hs.36989 coagulation factor VII (setum promrornbin conversion accelerator) 9 0 

5U 129596 AF035537 Hs.115521 REV3 (yeast homotogj-fike, catalytic subunit of DNA polymerase zeta 16 

129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (melanoma. p1 6, inhibits COK4) 22 

129628 U38945 Hs.1174 cycfin-dependent kinase Inhibitor 2A (me 14 

129629 AK000398 Hs.1 1747 hypoffietical protein FU20391 38 
129649 AD000092 Hs.16488 calreticulin 3 3 

55 129675 NMJ)15556Hs.172180 K1M0440 protein 13 4 

129680 U03749 gbiHuman chromogranin A (CHGA) gene, promoter an 141 

129689 AW746482 Hs.77873 B7homolog3 26 

129702 A1304966 Hs.12035 ESTs, Weakly similar to {38022 hypothetical protein [H. sapiens] 74 

129720 AA156214 Hs.12152 APMCF1 protein 20 

60 129721 NMJJ01415HS.211539 eukaryotic translation Wflafion factor 2, subunit 3 (gamma. 52kO) 1*7 

129726 H15474 Hs.132898 fatty acid desaturase 1 83 

129778 AK001676 Hs.12457 hypothetical protein FU10814 18 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence *u 
129800 AF052112 Hs.1 2540 lysosomal 17 

65 129806 A8023148 Hs.173373 WAA0931 protein 1*2 

129815 BE565817 Hs.26498 hypothetical protein FU21657 31 

129840 KNL006590Hs.12820 SnRNP assembly defective 1 homokxj ^8 
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65 



129861 

129864 

129869 

129922 

129945 

129953 

129972 

129983 

129969 

130010 

130081 

130082 

130097 

130100 

130111 

130112 

130112 

130128 

130135 

130211 

130212 

130236 

130241 

130242 

130249 

130263 

130287 

130310 

130353 

130356 

130357 

130359 

130367 

130372 

130393 

130399 

130407 

130409 

130419 

130441 

130446 

130455 

130455 

130471 

130485 

130487 

130498 

130503 

130511 

130511 

130526 

130542 

130544 

130553 

130556 

130567 

130568 

130574 

130586 

130598 

130601 

130614 

130617 

130617 

130618 

130667 

130674 



X53002 
AA916785 
AA916785 
176937 



D80001 

R85367 

AL035588 

X79201 

D81963 



AL049999 Hs.85963 DKFZP564M182 protein 2.2 

A1393237 Hs.129914 runt-related transcription (actor 1 (acute myeloid leukemia 1; ami1 oncogene) 1 7 

AI222069 Hs.13015 hypothetical protein similar to mouse DnaJH 27 

AF042379 Hs.13386 gamma-tubulin complex protein 2 45 

BE514376 Hs.165998 PAM mRNA-blnding protein 1*8 

AA412195 Hs.13740 ESTs £5 

AW753185 Hs.180628 dynamln Hike 1 a 

U09848 Hs.132390 zinc finger protein 36 (KOX 18) 1*3 

AB0 15856 Hs.247433 activating transcripfion factor 6 4 0 

AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 16 

AA287325 Hs.14713 ESTs 4^ 

S73265 Hs.1473 gastrfn-feleasing peptide 18 

AL046962 Hs.14845 forkhead box 03A 28 

AL135561 Hs.14891 hypothetical protein FU21 047 2 3 

Hs.149846 Integrin, beta 5 2.3 

Hs.180610 splicing factor proline/glutamine rich (polypyrimidine tract-binding protein-associated) 10 

Hs.180610 splicing factor proline/glutamine rich ( 2.1 

Hs. 150477 Werner syndrome 18 

AA311426 Hs.21635 tubulin, gamma 1 &1 
NM_003358Hs.23703 ESTs. Moderately simflar to CEGTJWMAN CERAMIDE GLUCOSYLTRANSFERASE [H^aplensl 16 

Hs.152629 KIAA0179 protein 13 

Hs.51957 splicing factor, arginine/serine-rich 2, interacting protein 2.0 

Hs.153203 MyoD family Inhibitor 3.2 

Hs.153221 synovial sarcoma, translocated to X chromosome 5.4 

Hs.322852 GAS2-related on chromosome 22 4.8. 

NM_002497Hs.153704 NIMA (never in mitosis gene a)-related kinase 2 1.4 

AA479005 Ks.1 54036 tumor suppressing subtransferable candidate 3 2.6 

AB011121 Hs.154248 amyotrophic lateral sclerosis 2 (Juvenile) chromosome region, candidate 3 6.3 

Z19084 Hs.172210 MUF1 protein 6.2 

AF127577 Hs. 155017 nuclear receptor Interacting protein 1 24 

AJ224442 Hs.155020 putative methyltransferase 34 

NMJ)13449Hs.277401 bromodomain adjacent to zinc finger domain, 2A 8.5 

AI135301 Hs.8768 hypothetical protein FU10849 1 4 

AI077464 Hs.5011 RNA binding motif protein 9 33 

N89487 Hs.155291 K1AA0005 gene product 18 

AW374106 Hs.155356 hypothetical protein MGC2840 simOar lo a putative glucosyltransferase 34 

BE385099 Hs.334727 hypothetical protein MGC3017 2 3 

NM_001 197HS.155419 BCl2-interacfing killer (apoptosis4ndudng) 2 7 

AF037448 Hs.155489 NS1-associated protein 1 ~ 18 

U63630 Hs.155637 protein kinase, DNA-activated, catalytic polypeptide 2.3 

BE513202 Hs.15589 PPAR binding protein 3.9 

D90041 Hs.155956 N-acetyltransferase 1 (arylamine N^acetyltransferase) 33.6 

D90041 Hs.155956 N-acetyttransferase 1 (arylamine N-acety 4.6 

AL121438 Hs.183706 adducin 1 (alpha) * 27 

BE245851 Hs. 180779 H2B histone famBy, member B 5.0 

U49844 Hs.77613 ataxia telangiectasia and Rad3 related 43 

L38951 Hs.180446 karyopherin (tmportiri) beta 1 16 

BE208491 Hs.295112 KIAA0618 gene product 16.1 

L32137 Hs.1584 cartilage oligomeric matrix protein (pseudoachondroplasia, epiphyseal dysplasia 1, multiple) 6.1 

L32137 Hs.1584 cartilage oligomeric matrix protein (pse 53 

AW876523 Hs.15929 hypothetical protein FU12910 21 

U64675 Hs. 179825 RAN binding protein 2-like 1 78 

AA321238 Hs.4310 eukaryotic translation initiation factor 1A 1.5 

AF062649 Hs.252587 pituitary tumor-transforming 1 14,4 

AI907018 Hs. 15977 Empirically selected from AFFX single probeset 47 

AA383092 Hs.1608 replication protein A3 (14kD) 79 

AA232119 Hs.16085 putative Gfrotein coupled receptor 33 

AF083208 Hs. 16178 apoptosis antagonizing transcription factor \ i 

AB007891 Hs.16349 KIAA0431 protein 56 

AUJ42210 Hs.16493 hypothetical protein DKFZp762N2316; WAA1803 protein 14 

AA609738 Hs.16525 ESTs 1(5 

A1354355 Hs.16697 dowiwegulator of transcription 1, TBP-blndlng (negative cofactor 2) 1.3 

M90516 Hs.1674 gtutamine^tfose^-phosphate tra^ 12.1 

M9Q516 Hs.1674 gIutamine-froctose-6-phospratetrajisamm 2.4 

AA383439 Hs.16758 Spir-1 protein 159 

BE246961 Hs.17639 Homo sapiens ublquitin protein ligase (UBE3B) mRNA, partial cds 13 9 

AL048842 Hs.194019 attractin 1.5 
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130675 AA442233 Hs.17731 hypothetical protein FU12892 5.4 

130692 AA652501 Hs.13561 hypothetical protein MGC4692 5.0 

130693 R68537 Hs.17962 ESTs 2.0 
130712 AJ271881 Hs.279762 bromodomain-containing 7 1.8 

5 130714 AI348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 2.0 

130730 AB007920 Hs,18586 KIAA0451 gene product 3.7 

130744 H59696 Hs.18747 POP7 (processing of precursor, S. cerevtslae) homolog 3.1 

130751 AF052105 Hs.18879 chromosome 12 open reading frame 1.4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.21 1562 ATP-binding cassette, sub-family A (ABC1), member 1 5.1 

130789 AK000355 Hs.8899 sirtuin (silent mating type information regulation 2, S.cerevisiae, homobg) 5 52 

130815 AB018298 Hs.19822 SEC24 (S. cerevbiae) related gene family, member D 1.5 

130836 J05068 Hs.2012 transcobalamin I (vitamin B12 binding protein, R binder family) 15.7 

130841 AL15746B Hs.325825 Homo sapiens cDNA FU2084B lis. done ADKA01732 2.8 

1 5 130843 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 protein x 013 [H.sapiens] 1.5 

130844 U76248 Hs.20191 seven in absentia (DrosophSa) homolog 2 3.4 

130855 AJ243706 Hs.143323 putative DNA/chromatin binding motif 1.7 

130861 NMJ)16578Hs.20509 HBV pX associated protein-8 1.9 

130879 ML003416HS.2076 zinc finger protein 7 (KOX 4, clone HF.1 6) 1.4 

20 130880 BE514434 Hs.20830 Wnesln-Bke2 2.1 

130892 AL120837 Hs.20993 high-glucose^egulated protein 8 2.4 

130898 AB033078 Hs.186613 sphtngosine-1-phosphate lyase 1 1.7 

130911 BE409769 Hs.21189 DnaJ(Hsp40)lwnwlcg,subfaiT%A,nriemDer2 1.8 

130919 N79110 Hs,21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 2.3 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1, 91 kD 5.4 

130971 N39842 Hs.301444 KIAA1673 2.2 

130992 BE398091 Hs.74316 desmoplakin (DPI, DPlI) 1.8 

130993 T97401 Hs.21929 ESTs 1.6 
131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 1.6 

30 131028 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), gamma 1.2 

131042 AI826288 Hs.171637 hypothetical protein MGC2628 1.6 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cys-X-Cys), member 10 7.4 

131046 AA321649 Hs.2248 small Inducible cytokine subfamily B (Cy 3.0 

131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H^apiens] 1.7 
35 131060 AA194422 Hs.22564 myosin VI 5.1 

131060 AA194422 Hs.22564 myosin VI 2.5 

131070 N53344 Hs.22607 ESTs 7.1 

131076 AA749230 Hs.26433 doBchyt-phosphate (UDP-N-acetylglucosamine) N^cetylglucosamineplwsphotransferase 1 { 2.0 

131076 AA749230 Hs.26433 doBchyHjh<)sphate(UDP^cetylglucosam 1.9 

40 131099 AL133353 Hs.226581 COX15 (yeast) homolog, cytochrome c oxidase assembly protein 7.0 

131174 NMJX56540HS.29131 nuclear receptor coacfivator 2 1.9 

131185 BE280O74 Hs.23960 cycfinBI 5.8 

131206 AW138839 Hs.24210 ESTs 2.0 

131213 AA885699 Hs.24332 CGI-26 protein 7.0 

45 131225 H62087 Hs.31659 thyroid hormone receptor-associated protein, 95-kD subunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 2.9 

131233 089053 Hs.268012 fatty^ci(k:oenzynieARgase,tong-<^iain3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 thioredoxin dorrain^ontahing 2.8 

50 131247 AL043100 Hs.326190 fatty add amide hydrolase 5.6 

131281 AA251716 Hs.25227 ESTs 5.7 
131283 X80038 Hs.339713 Homo sapiens clone F19374 APO E-C2 gene cluster U 

131305 AV656017 Hs.184325 CG1-76 protein 5.0 
131320 AA505691 Hs.145696 spBdng factor (CC1 .3) 1.8 

55 131339 AF058696 Hs.25812 Nymegen breakage syndrome 1 (nibrin) 2.6 
131339 AF058696 Hs.25812 Ngmegen breakage syndrome 1 (nlbrfn) 2.6 
131375 AW293165 Hs.143134 ESTs 5.4 
131390 BE269388 Hs.182698 mitochondrial ribosomal protein L20 5.3 
131410 8E259110 Hs.279836 HSPC166 protein 2.2 

60 131412 NM_012247Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein 2.0 
131429 AL046302 Hs.26750 hypothetical protein RJ21908 1.4 
131458 BE297567 Hs.27047 hypothetical protein FU20392 1.7 
131475 AA992841 Hs.27263 WAA1458 protein 2.0 
131501 AV661958 Hs.8207 GK001 protein 2.6 

65 131501 AV661958 Hs.8207 GK001 protein 1.6 
131511 AA732153 Hs,27865 Homo sapiens cDNA: FU21333 fis, done COL02535 2.0 
131528 AU076408 Hs^8309 UDP-glucose dehydrogenase 1.6 
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131532 
131543 
131544 
131562 
131564 
131564 
131569 
131618 
131622 
131623 
131623 
131643 
131653 
131656 
131669 
131692 
131714 
131722 
131737 
131760 
131760 
131763 
131772 
131774 
131787 
131793 
131795 
131798 
131817 
131824 
131850 
131878 
131885 
131885 
131887 
131900 
131900 
131904 
131905 
131913 
131916 
131925 
131929 
131941 
131950 
131962 
131965 
131971 
131977 
131985 
131991 
132019 
132031 
132062 
132084 
132103 
132105 
132116 
132176 
132180 
132192 
132194 
132203 
132207 
132235 
132240 
132252 



BE268278 Hs.28393 
AW966881 Hs.41639 
AL355715 Hs.28555 
NM_G03512Hs.28777 
T93500 Hs.28792 
T93500 Hs.28792 
AL389951 Hs.271623 
BE393822 Hs.29645 
R78195 Hs.29692 
AB037791 Hs.29716 
AB037791 Hs.29716 
AW410601 Hs.30026 
AW960597 Hs.30164 
AI218918 Hs.30209 
X52486 Hs.3041 
BE559681 Hs.30736 
AA642831 Hs.31016 
D13757 Hs.311 
AK001641 Hs.31323 
X76732 Hs.3164 



X76732 
AI878932 



Hs.3164 
Hs.317 



AA382590 Hs.170980 
BE267158 Hs.169474 
D87077 Hs.196275 
AW966127 Hs.32246 
BE501849 Hs.32317 
X86098 Hs.301449 



Hs.3280 

Hs.32935 

Hs.33184 



U 20536 
U28838 
AI251317 

AA083764 Hs.6101 
BE502341 Hs.3402 
BE502341 Hs.3402 
W17064 Hs.332848 
AA099014 Hs.231029 
AA099014 Hs.231029 
AF078866 Hs.284296 
AA179298 Hs.3439 
AW207440 Hs.185973 
AA025976 Hs.34569 
AF151048 Hs.183180 
BE541211 Hs.34804 
BE252983 Hs.35086 
AA355113 Hs.35380 
AKD0OO46 Hs.267448 
W79283 Hs.35962 
BE567100 Hs.154938 
U90441 Hs.3622 
AA503020 Hs.36563 
AF053306 Hs.36708 
H56995 Hs.37372 
AF193844 Hs.3758 
BE266155 Hs.3832 
NM.002267Hs.3886 
BE171921 Hs.3991 
AV646076 Hs.39959 
AW960474 Hs.40289 
AA857025 Hs.8878 
NM_004460Hs,418 
AA206153 Hs.4209 
R42432 Hs.4212 
NM_004782Hs. 194714 
BE206939 Hs.42287 
AV658411 Hs.42656 
AB018324 Hs.42676 
Hs.141269 



hypotheUcal protein MGC2592 
programmed cell death 2 
programmed ceil death 9 (PDCD9) 
H2A histone family, member L 
Homo sapiens cDNA FU11041 fis, clone PLACE1004405 
Homo sapiens cONA FLJ11041 fis, done PL 
nudeoporin50kO 

Homo sapiens mRNA; cDNA DKFZp761C029 (from clone DKFZp761C029); partial cds 
Homo sapiens cDNA FU11436 fis, done HEMSA1001213 
hypothetical protein FU 10980 
hypotheUcal protein RJ10980 
HSPC182 protein 
ESTs 

KIAA0854 protein 
uracB-DNAgrycosylase2 • 
KIAA0124 protein 
putative DNA binding protein 
phosphoribosyl pyrophosphate amidotransferase 

inhibitor of kappa fight polypeptide gene enhancer in B-cefis, kinase complex-associated protein 
nudeoWndin2 
nudeoblndin 2 
topoisomerase (DNA) I 
KIAA0948 protein 
DKFZP586JQ1 19 protein 
KIAA0240 protein 

Homo sapiens cONA FU14656 fis, clone NT2RP2002439 • 
high-mobility group 208 
adenovinis 5 E1A binding protein 
caspase 6, apoptosis-related cysteine protease 

TATA box binding protein (TBP)-assodated factor, RNA polymerase III, GTF3B subunit 2 
ESTs 

hypothetical protein MGC3178 
ESTs 
ESTs 

SWI/SNF related, matrix associated, acton dependent regulator of chromatin, subfamily e, member 
Homo sapiens, done MGC:15961, mRNA, complete cds 
Homo sapiens, done MGC: 15961, mRNA, com 
Homo sapiens cDNA: FU22993 fis, done KAT1 1914 
stomatui-Oke 2 



ESTs 

anaphase promoting complex subunit 1 1 (yeast APC1 1 homotog) 
Homo sapiens cDNA FU1 1472 fis, clone HEMBA1001711 

. .L T . ,. 1 



x 001 protein 
hypothetical protein FU20039 
ESTs 

hypothetical protein MDS025 



procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxyiase), alpha polypeptide II 
hypothetical protein FU22418 

budding uninhibited by benzimidazoles 1 (yeast homotag), beta 

Homo sapiens DNA binding peptide mRNA. partial cds 

COP9 complex subunit 7a 

clathrin-assodated protein AP47 

karyopherin alpha 3 (importin alpha 4) 

ESTs 

ESTs 

ESTs 

Wnesin-lixe 1 

fibroblast activation protein, alpha 

mitochondrial ribosomal protein L37 

ESTs > 

synaptosomal-assocfated protein, 29kD 

E2F transcription {actor 6 

KIAA1681 protein 

K1AA0781 protein 

Homo sapiens cONA: RJ21550 fis, clone COL06258 
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7.4 
2.2 
2.1 
1.7 
5.1 
1.8 
5.0 
1.8 
1.3 
Z2 
1.9 
2.9 
1.3 
2.8 
2.6 
5.6 
2.9 
3.4 
3.8 
2.9 
2.8 
3.4 
25.5 
5.5 
2.4 
7.9 

I. 4 
4.1 
4.2 
3.5 
5.1 
5.8 
13.7 
Z4 

1 3.2 
8.7 
2.0 
5.5 

II. 3 
1.7 
5.2 
2.7 
5.3 
2.3 
1.5 
2.3 
1.4 
3.5 
6.5 
Z4 
2.1 
3.2 
5.8 
1.5 
3.7 
1.4 
5.8 
1.7 
3.3 
14.7 
5.5 
4.4 
2.2 
22 
7.8 
1.5 
1.3 
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132266 AA301228 Hs.43299 hypothetical protein RJ 12890 c 7 
132273 AA227710 Hs.43658 DKFZP586L151 protein J'l 
132276 M653507 Hs.285711 hypothetical protein RJ13089 21 
« £22 t™ 71 solutecarrierfamily2^^^ { A 

5 132294 AB023191 Hs.44131 WAA0974 protein 1n n 

132298 NM_015986Hs.7120 cytokine receptor-like molecule 9 Jq 

132299 AW405882 Hs.442Q5 corfistatin 1% 
132325 N37065 Hs.44856 hypothetical protein HJ121 16 2 0 
132348 AW067708 Hs.170311 heterogeneous nudear ribonucleoprotein D-fike 65 

10 132370 AW572805 Hs.46645 ESTs 3*q 

<»rc1UDP^a(tose:N-acetylgalacfosamInw t 5 
132376 AI279892 Hs.46801 sorting nexin 14 19 c 
132384 AA312135 Hs.46967 HSPC034 protein ]ti 
132393 AL135094 Hs.47334 hypothetical protein FU1 4495 fg 

15 132450 AA100012 Hs.48827 hypothetical proteh FU12085 \% 
132452 AW973521 Hs.247324 mitochondrial ribosomai protein S14 61 
132456 AB011084 Hs.48924 KIAA0512 gene product ALEX2 17 
132465 AW169847 Hs.49169 KIAA1634 protein oc 

132470 AI224456 Hs.4934 Ksapiens potyA site DNA 50 

20 132484 X16660 Hs.1 19007 RAB4, member RAS oncogene family 1 4 

132518 AW885606 Hs.5064 ESTs 61 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeasfl-like 1 3*3 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiae)-fike 1 20 

132532 AA454132 Hs.5080 rnitochondrial ribosomai protein L16 2*9 

25 132534 BE388673 Hs.5086 hypothetical protein MGC10433 22 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 73 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1,S.cerevislae. homolog of 17 

132574 AW631437 Hs.5184 TH1 drosophila homolog 7 

132596 AK001484 Hs.5298 CGU5 protein 22 

30 132611 AA345547 Hs.53263 hypothetical protein FU 13287 09 

132612 H12751 Hs.5327 PR01914 proteh ' tt 

132616 BE262677 Hs^83558 hypothetical protein PR01 855 140 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 114 
132648 U51127 Hs.54434 hypothetical protein MGC1 715 9 

35 132668 AB018319 Hs.5460 KIAA0776 protein 26 

132692 AW191962 Hs^49239 collagen, type VIII, alpha 2 on 

132715 F11875 Hs.5534 Homo sapiens cDNA FU12961 fis, clone NT2RP2005645 1 5 

]HS5 ^S?? 50085,554 s ta™^ n <^e antigen A2 (60^^ io 

132724 A1142265 Hs.55498 geranylgeranyt diphosphate synthase 1 94 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 i2 4 

132744 AA010233 Hs.55921 glutamyl-prolyWRNA synthetase 146 

132760 AA125985 Hs.56145 thymosin, beta, Identified In neuroblastoma cells 27 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3 0 

132773 AA459713 Hs.295901 KIAA0493 protein 03 

45 132784 AI142133 Hs.56845 GDP dissociafion Inhibitor 2 18 

132798 A1026701 Hs.5716 K1AA0310 gene product 37 

132807 U07418 Hs.57301 mul (E cofi] homolog 1 (colon cancer, nonporyposis type 2) t'a 

132810 AB007944 Hs.5737 KIAA0475 gene product 59 

50 132815 AI815189 Hs.57475 sex comb on midteg homolog 1 64 

132817 N27852 Hs.57553 tousled-like kinase 2 36 

132821 AJ251595 Hs. 1 6961 0 CD44 antigen (homing function and Indian Wood group system) 18 

132833 U78525 Hs.57783 eukaryoHctranslatk3ninitiaDOTfactor3,subuna9(eta 116kD) 14 6 

132842 NNL016154Hs.279771 Homo sapiens clone PP1596 unknown mRNA \q 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2*5 

132851 U09716 Hs.287912 lecfin, mannose-binding, 1 14 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4 2 

132869 AW963217 Hs.203961 ESTs, Moderately similar to AF1 16721 89 PR02168 [Haptens] or 

132873 AW007683 Hs.58598 K1AA1266 protein 2 0 

60 132875 Nfi/L004850Hs.58617 Rho-assoclated, coiled-coli containing protein kinase 2 16 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (non-standard symbol) < a 

132897 AW503667 Hs.59545 ring finger protein 15 A 

132902 A1936442 Hs.59838 hypomefical protein FU10808 61 

132912 AW732760 Hs.167578 Homo sapiens cDNA FU11095 fis. clone PLACE1005374 71 

65 132913 W78714 Hs.60257 Homo sapiens cDNA FU 13598 fis, clone PLACE1009921 2fl 

132940 T79136 Hs.127243 Homo sapiens mRNA far KIAA1724 protein, partial cds si 

132941 A1817165 Hs.6120 hypothetical protein FU13222 ^3 

133 



WO 02/059377 



PCT/US02/02242 



132942 AA554458 Hs.197751 
I32952 AI658580 Hs.61426 
32962 AA576635 Hs.6153 

132972 AA034365 Hs288924 

132973 AA035446 Hs.323277 
132977 AA093322 Hs.301404 
132980 AA040696 Hs.62016 
I32994 AA112748 Hs.279905 
133012 AA847843 Hs.62711 

133015 AJ002744 Hs.246315 

133016 A1439688 Hs.6289 
I33053 AI065016 Hs.6390 
(33062 AW500374 Hs.64056 
I330S9 BE247441 Hs.6430 
33091 AK00162A Hs.64691 
33110 AA808177 Hs.65228 
33134 AF198620 Hs.65648 
133145 H94227 Hs.6592 
133152 Z11695 Hs.324473 
33174 AA431620 Hs.324178 
133175 AW955632 Hs.66668 
133177 X97795 Hs.66716 
33197 AI275243 Hs.180201 
133208 AI801777 Hs.6774 
I33226 AW954569 Hs.296287 
I33228 AI492924 Hs.6831 
I33240 AK001489 Hs.242894 
I33254 AI567421 Hs273330 
I33266 A1160873 Hs.69233 
33268 AW956781 Hs.293937 
I33285 M76477 Hs.289082 
133291 BE297855 Hs.69855 
133314 AA102670 Hs.70725 
133321 T79526 Hs.179516 
I33327 AL390127 Hs.7104 
I33347 BE257758 Hs.71475 
133360 AI016521 Hs.71816 

133366 AA292811 Hs.72050 

133367 AF231919 Hs.18759 
I33370 AF245505 Hs.72157 
133383 BE313555 Hs.7252 

133390 AI950382 Hs.72660 

133391 AW103364 Hs.727 
I33394 AA305127 Hs.237225 
133437 AL031591 Hs.7370 

133452 NMJ»2759Hs.274382 

133453 AI659306 Hs.73826 
133500 AW964804 Hs.74280 
133529 W45623 Hs.74571 
I33540 AL037159 Hs.74619 
I33543 AU077073 Hs.108327 

133578 AU077050 Hs.75066 

133579 X75346 Hs.75074 
33582 BE391579 Hs.75087 
33594 AW160781 Hs.172589 
I33595 AA393273 Hs.75133 
133599 NM_002885Hs.75l51 
133621 NMJXM893HS.75258 
133627 NA/L0Q2047HS.7528O 
133631 NM.000401HS.75334 
I33649 U25849 Hs.75393 
I33690 AV661185 Hs.75574 
I33720 L27841 Hs.75737 
I33722 AW969976 Hs.279009 
133751 AW402048.H3.334787 
I33757 T52946 Hs.196209 
I33760 BE271766 Hs.181357 



KIAA0666 protein 

Homo sapiens mesenchymal stem cell protein OSC96 mRNA, partial cds 
CGM8 protein 

Homo sapiens cONA FU1 1392 lis, done HEMBA1000575 
ESTs 

RNA binding motif protein 3 
ESTs 

clone HQ0310PRO0310p1 

Homo sapiens, dona IMAGE:3351295, mRNA 



1.8 
2.2 
4.9 
2.7 
5.3 
3.2 
1.3 
3.0 
10.3 



. <-...« v Uf #.w..d t who i»v^«t.ojj i^aj, 1 1 mi ha -|Q 3 

UDP^^oetyMpha-^alactosamine;polypeptide N-acety(galactosaminy[(ransferase 7 (Ga(NAc-T7) 2 1 
hypothetical protein FU20886 {3 
Homo sapiens clone FLB3344 PRO0845 mRNA, complete cds 
PRO0149 protein 

protein with polygiutamine repeat calcium (ca2^ homeostasis endoplasmic reticulum protein 
KIAA0483 protein 
ESTs 

RNA binding motif protein 8A 
Homo sapiens, clone IMAGE:2961368, mRNA, partial cds 
rnitogen^activated protein kinase 1 
hypothetical protein MGC2745 

ESTs, Weakly similar to S19560 proOne-rlch protein MP4 - mouse flvlrnusculusl 
RAD54(S.cerevislae)-Iike 
hypothetical protein FU20671 
ESTs 

Homo sapiens, Similar to bromodomaiiw»ntalnlng 4, done IMAGE:3542455, mRNA, partial cds 
goJgi phosphoproteln 1 
AOP-ribosyiatlon factor-like 1 
Homo sapiens, done IMAGE:3544662, mRNA. partial cds 
zinc finger protein 

ESTs, Weakly similar to FXD2_HUMAN FORKHEAD BOX PROTEIN D2 [Usapiens] 
GM2 gangGosJde activator protein 
NRAS-related gene 

gamma-amlnobulync acid (GABA) A receptor, pi 
integral type I protein 
KhippeHSke factor 13 
add duster protein 33 

v-akt murine thymoma viral oncogene homolog 1 
non-matastatic cells 5, protein expressed in (nucleoside-diphosphate kinase) 
KIAA0539 gene product 
DKFZP564I1922 protein 
K1AA1224 protein 
phosphatidylserine receptor 
Inhibin, beta A (activtn A. activin AB alpha polypeptide) 
hypothetical protein HT023 
phosphotidyflnositol transfer protein, beta 
protein kinase, interferon-lndudble double stranded RNA dependent 
protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 
hypothetical protein FU22237 
AOP-ribosylation factor 1 

proteasome (prosome, macropain) 26S subunit, non-ATPase, 2 
damage-specific DNA binding protein 1 (127kD) 
transBn 

mftogeivactivated protein tinase-activated protein kinase 2 
Fas-activated serine/threonine kinase 
nuclear phosphoproteln similar to S. cerevislae PWP1 
transcription factor 6-0ke 1 (mitochondrial transcription factor l-like) 
RAP1, GTPase acfivating protein 1 
H2A histone family, member Y 
grycyWRNA synthetase 
exostoses (multiple) 2 
add phosphatase 1, soluble 
mitochondrial ribosomal protein L19 
pericentric Jar material 1 
matrix Gla protein 

Homo sapiens, Similar to likely ortholog of yeast ARV1, done IMAGE:3506392, mRNA 
RAE1 (RNA export 1, S.pombe) homotog 
tamirun receptor 1 (67kD, ribosomal protein SA) 
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133765 M52194 Hs.75929 cadherinll.type^OB-cadhertnCosteoblasO 1.5 

133780 AA557660 Hs.76152 decorin 3.5 

133784 BE622743 Hs.301064 arfeptki 1 6.8 

• 133791 M34338 Hs,76244 spenrcdine synthase 16 

5 133797 AL133921 Hs.76272 feflnoblastoma-bindlng protein 2 1.4 

133622 050525 Hs.699 peptidylprolyl isomerase B (cydophQin B) 8.0 

133842 AW797468 Hs.285013 putative human HLA dass II associated protein I 135 

133845 AA147026 Hs.76704 ESTs 22 

133850 W29092 Hs7678 cellular retinotc acid-binding protein 1 1.8 

10 133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 2.0 

133865 AB011155 Hs.170290 decs, large (Drosophila) homolog 5 2.8 

133867 AW340125 Hs.76989 KIAA0097 gene product 6.7 

133868 AB012193 Hs.183874 culCn4A 2.5 
133881 U30872 Hs.77204 centromere protein F (350J400kD, mKosin) • 3.0 

15 133922 U30825 Hs.77608 splicing factor, arginine/seme-fich 9 1.4 

133924 D86326 Hs.325948 vesicle docking protein p1 15 5.4 

133929 MI0063Q6HS.211602 SMC1 (structural maintenance of chromosomes 1, yeasr>0ke1 4.9 

133936 L17128 Hs.77719 gamma-gtutamyl carboxylase 3.7 

133941 BE244332 Hs.77770 adaptor-related protein complex 3, mu 2 subunft 12.1 

20 133959 X81789 Hs.77897 splicing factor 3a, subunit 3. 60kD 9.7 

133976 AI908165 Hs.1 69946 GATA-blndlng protein 3 (T-ceB receptor gene activator) 3.1 

133989 ALO40328 Hs.78202 SW1/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 AI824113 Hs.7B281 regulatorof G-protein signalling 12 9.7 

134010 AB016092 Hs.197114 RNA binding protein; AT^ich element binding factor 2.4 

25 134015 031764 Hs.278569 sorting nexin 17 2.5 

134070 N&L003590HS.78946 cuilin 3 1,3 

134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 4.2 

134129 NMJ)14742Hs.79305 KIAA0255 gene product 2.2 

134134 H86504 Hs.173328 protein phosphatase 2, regulatory subunft B (B56), epsilon isoform 5.0 

30 134200 BE559598 Hs.197803 K1AA0160 protein 3.2 

134206 AF107463 Hs.79968 spBcing factor 30, survival of motor neuron-related 2.5 

134208 NM_000288Hs.79993 peroxisomal biogenesis factor 7 2.1 

134219 NMJ004O2Hs.8O206 glucose-6-phosphate dehydrogenase 9.1 

134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE3535294, mRMA, partial cds 2.8 

35 134275 AI878910 Hs.3688 cisplafin resistance-associated overexpressed protein 1.8 

134292 AI906291 Hs.81234 Immunoglobulin superfamily, member 3 2.0 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 6s, clone HEP04178, highly similar to HSU90909 2.5 

134305 U61397 Hs.81424 ubtquitiivfike 1 (sentrm) 2.8 

134324 AB029023 Hs.179946 WAA1100 protein 10.4 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (versican) 1.9 

134329 N92036 Hs.81848 RAD21 (S. pombe) homolog * 2.6 

134337 Nh/L004922Hs.81964 SEC24 {$. cerevisiae) related gene family, member C 2.3 

134348 AW291946 Hs.82065 interteukin 6 signal transducer (gp130, oncostatin M receptor) 13.0 

45 134376 X0656Q 449 Hs!82396 2^!i!^^ 15 

134379 AW362124 Hs.323193 hypofliefical protein MGC3222 8.1 
134384 A1589941 Hs.8254 Homo sapiens, Simflarto tumor differenfialry expressed 1, done l^GE3639252,mRN A, partial cds 2.6 

134391 AA417383 Hs.82582 Integrin, beta-like 1 (with EGF-Oke repeat domains) 4.1 

134395 AA456539 Hs.8262 lysosomal 1.7 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 2.6 

' 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 1.3 

134411 BE272095 Hs.1 67791 reticutocalbln 1, EF^hand calcium binding domain 3.2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 10.3 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 11B 2.4 

134446 M1 12036 Hs.83419 KIAA0252 protein 1.2 

134447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer In B-cells 1 (p105) 1.6 
134470 X54942 Hs.83758 CX28 protein kinase 2 2.1 
134480 NM_005000Hs.83916 Emplricaily selected from AFFX single probesel 5.3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 2.5 

134498 AW246273 Hs.84131 threonyMRNA synthetase 2.1 

134513 AA425473 Hs.84429 WAA0971 protein 3.8 

134516 AK001571 Hs.273357 hypothetical protein FU10709 2.4 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cefaclor 4 6.7 

65 134529 AW41 1479 Hs.848 FKSO&bindhg protein 4 (59kD) 23 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 5.5 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 
134624 
134632 
134654 
134664 
134666 
134687 
134692 
134705 
134714 
134719 
134722 
134724 
134746 
134751 
134790 
134806 
134834 
134850 
134853 
134859 
134880 
134910 
134925 
134955 
134971 
134975 
135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 
135257 
135263 
135274 
135294 
135295 
135307 
135321 
135354 
135361 
135389 
135400 
134975 
135011 
135022 
135032 
135077 
135083 
135095 
135096 
135153 
135181 
135199 
135207 
135214 
135243 
135245 



AW068223 Hs.171581 
AF035119 Hs.8700 



X78520 
AK001741 



Hs.174139 
Hs.8739 



AA256106 Hs.87507 

BE391929 Hs.8752 

U62317 Hs.88251 
NM_G03474Hs.8850 

BE161887 Hs.88799 

Y14768 Hs.890 

AA852985 Hs.89232 

AF129536 Hs.284226 

AF045239 Hs.321576 

X07871 Hs.89476 

AW630803 Hs.89497 

BE002798 Hs.287850 

AD001528 Hs.89718 

AW451370 Hs.8991 

AI701162 Hs.90207 

BE268326 Hs.90280 

D26488 Hs.90315 

AI879195 Hs.90606 

AA532963 Hs.9100 

AW885909 Hs.6975 

AW401361 Hs.91773 

AI097346 Hs.286049 

R50333 Hs.92186 

AB037835 Hs.92991 
NM_000408Hs.93201 

AW301984 Hs.173685 

AW503733 Hs.9414 

AB036063 Hs.94262 

AF027219 Hs.9443 

AA081258 Hs.1 32390 

AI093155 Hs.95420 

BE250865 Hs.279529 

AA477514 Hs.96247 

N26427 Hs.9634 



ubiquitin C-terminal hydrolase UCH37 
deleted in Ever cancer 1 
chloride channel 3 
hypothetical protein FU10879 
ESTs ■ 

transmembrane protein 4 
arytsulfatase A 

a disintegrin and metalloproteinase domain 12 (meltrin alpha) 
anaphase-promoting complex subunit 10 



T78802 

BE463721 

AI028767 



Hs.96560 
Hs.97101 
Hs.262603 



AW291023 Hs.97255 

AI088775 Hs.55498 

AA448460 Hs.1 12017 

AA150320 Hs.9800 

A1090838 Hs.98006 

AI743770 Hs.98368 

AI652069 Hs.98614 

AA456454 Hs.183418 

AA373452 Hs.167700 

U05237 Hs.99872 



chromobox homolog 5 (DrosophOa HP1 alpha) 
F-box only protein 6 
ring finger protein 22 

CD2 antigen (p50) p sheep red blood cetl receptor 
lamin 61 

integral membrane protein 1 
spermine synthase 

adaptor-related protein complex 1, gamma 2 subunit 
hypothetical protein MGC1 1 1 38 

5^m!noimidazole-4-carboxarnide ribonucleotide formyitransferase/IMP cydohydrolase 
KIAA0007 protein 
1 5 kDa selenoprotein 

Homo sapiens cONA FU13100 fis, clone NT2RP3002255 
PRO1073 protein 

protein phosphatase 2 (formerly 2A), catalytic subunit, alpha isoform 
phosphoserine aminotransferase 
Leman coiled-coi] protein 
KIAA1414 protein 

glyceroW-phosphate dehydrogenase 2 (mitochondrial) 
hypothetical protein RJ12619 
KIAA1488 protein 

p53-inducib!e ribonucleotide reductase small subunit 2 homolog 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 

px19-iike protein 

tra n sfi rv-ass ociated factor X 

ESTs, Highly similar to C10_HUMAN PUTATIVE C10 PROTEIN [H.sapiensJ 
hypothetical protein FU11656 
putative G protein-coupled receptor 
ESTs 

ESTs, Weakly similar to A46010 X-iinked retinopathy protein [H.sapiensJ 
geranylgeranyl diphosphate synthase 1 
GE36 gene 

protein kinase Njmu-R1 
ESTs 

ESTs, Weakly similar b K1AA0822 protein [H.sapiens] 
ribosome binding protein 1 (dog 180kD homolog) 
ceil division cycle 2-Cke 1 (PITSLRE proteins) 
Homo sapiens cDNA FU10174 fis, clone HEMBA1003959 



X78592 
R50333 



Hs.99915 
Hs.92186 



AB037835 Hs.92991 
NMJ)00408Hs.93201 
AW301984 Hs.173685 
AW503733 Hs.9414 
AB036063 Hs.94262 
AF027219 Hs.9443 
AA081258 Hs.132390 
AI093155 Hs.95420 
BE250865 Hs.279529 
AA477514 Hs.96247 



N26427 
T78802 
BE463721 
AJ028767 



Hs.9634 
Hs.96560 
Hs.97101 
Hs.262603 



androgen receptor (dlhydrotestosterone receptor; testicular feminization; spinal and bulbar 
Leman coiled-coil protein 
WAA1414 protein 

glycerol-3-phosphate dehydrogenase 2 (mi 
hypothetical protein FU12619 
KIAA1488 protein 

p53-Inducible ribonucteofide reductase s 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 

px19-8ke protein 

transGn-assodaled factor X 

ESTs, Highly similar to C10 JHUMAN PUTATl 

hypothetical protein FU11656 

putative G protein-coupled receptor 

ESTs 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-Hnked 
135263 AI088775 Hs.55498 geranytgeranyl diphosphate synthase 1 
135274 AA448460 Hs.1 12017 GE36gene 

135294 AA150320 Hs.9800 protein kinase NImu-R1 

135295 AJ090838 Hs.98006 ESTs 

135307 AI743770 Hs.98368 ESTs, Weakly similar to K1AA0822 protein 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 160kD ho 

135354 AA456454 Hs.183418 cell division cycle 2-fike 1 (PITSLRE pr 

135361 AA373452 Hs.167700 Homo sapiens cDNA FU 10174 fis, clone HE 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

135400 X78592 Hs.99915 androgen receptor (dlhydrotestosterone r 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 

302276 AW057736 Hs^23910 HER2 receptor tyrosine kinase (oerb-b2 

303135 AW592789 Hs.279474 HSPC070 protein 

303686 AK000714 Hs.109441 MSTP033 protein 

310085 R43191 Hs.101248 Homo sapiens clone IMAGE:32553, mRNA seq 

315518 AA808229 Hs.167771 ESTs 

317781 NM_007057Hs.42650 ZW10 interactor 

320836 AI268997 Hs.197289 rab3 GTPasesactivating protein, ncrwata 

321114 AA902256 Hs,78979 Golgi apparatus protein 1 

322221 N24235 Hs.179662 nucleosome assembly protein 1-like 1 

322474 AF1 18083 Hs.29494 PR01912 protein 

322556 BE041451 Hs.177507 hypothetical protein 

323541 AF292100 Hs.104613 RP42homolog 

407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 

408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 

408813 AI580090 Hs.48295 RNA heltease family 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti 

413670 AB000115 Hs.75470 hypoflietical protein, expressed in osteo 

414108 AI267592 Hs.75761 SFRS protein kinase 1 

414846 AW304454 Hs.77495 UBXdomairKontaining 1 

416980 AA381133 Hs.80684 hlgrwnobility group (nonhistone chromoso 

417378 R57256 Hs.82037 TATA box binding protein (TBP)-associate 

418283 S79895 Hs.83942 calhepsin K (pycnodysostosis) 

418467 NMJW5910HS.85273 retinoblastoma-binding protein 6 

420269 U72937 Hs.96264 alpha thalassemia/mental retardation syn 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 

421225 AA463798 Hs.102696 MCT-1 proteh 

421642 AF172066 Hs.106346 retinoic acid repressible protein 

421828 AW891965 Hs.279789 histone deacetylase 3 

421983 AI252640 Hs.110364 peptidylprolyl isomerase C (cydophilin 

422052 AA302744 Hs.t04518 ESTs 

422055 NMJ)14320Hs.1 11029 putative heme-blnding protein 

423750 AF165883 Hs.298229 prefokfin2 

424001 W67883 Hs.137476 paternally expressed 10 (PEG10; WAA105 

425182 AF041259 Hs.155040 zinc finger protein 217 

425284 AF155568 Hs.155489 NS1 -associated protein 1 

426372 BE304680 Hs.169531 OEAO/H (Asp-Gtu-Ala-Asp/His) box polypep 

428049 AW183765 Hs.182238 GW128 protein 

428477 AW500533 Hs,11482 splicing factor, arginine/serine-rich 11 

437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular cardnoma-as 

441560 F13386 Hs.7888 Homo sapiens done 23736 mRNA sequence 

445580 AF167572 Hs.12912 skbl (S. pombe) homolog 

446999 M151520 Hs.334822 hypothetical protein MGC4485 

447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinal} 

447778 BE620592 Hs.71190 ESTs, Weakly similar to S1 6506 hypothefl 

448873 NMJJ03677HS.22393 density-regulated protein 

449687 W68520 Hs.331328 Intermediate Sament protein syncoilin 

450701 H39960 Hs.288467 Homo saptens cONA FL11 2280 fis, clone MA 

450703 AA011202 Hs.184771 nudear factor l/C (CCAAT-binding transc 

452461 N78223 Hs.108106 transcription factor 

452511 BE408178 Hs.285165 Homo sapiens cONA FU20845 fis, clone AD 

453157 AF077036 Hs.31989 DKFZP586G1722 protein 

453658 BE541906 Hs.87819 Homo sapiens, clone MG&2492, mRNA, comp 

100833 AF135168 Hs.108802 r^thytrr^eknide-sensrGve factor 

102481 U50360 gbfluman calcium, calmoduiMependent p 
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102827 BE244588 Hs.6456 chaperonln containing TCPl.subunlt 2 (b 7.9 

103549 8E270465 Hs.78793 protein Wnase C, zeta 2.0 

104331 AB040450 Hs.279862 cdk Inhibitor p21 binding protein 5.3 

110018 AW579842 Hs.104557 hypothetical protein FU10697 2.0 

5 115008 AK001827 Hs.87889 r^case-moi ' 5.7 

119075 M10905 Hs.287820 ftbronectin 1 U 

119615 AL034423 Hs.75875 ublqultirwonjugatlng enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 spHcing factor (CC1 .3) 1.7 

127609 X80031 Hs.530 coOaaen. tvoe IV. alpha 3 (Goodpasture 2.4 

10 129209 R62676 Hs. 17820 Rho-assodated, coted-coU containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs.192853 ubiquitin-conjugaQng enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotic translation initiation factor 3.3 

131135 NM.016569HS.267182 TBX3-Iso protein 1.3 

15 131853 AI681917 Hs.3321 ESTs, Highly similar to IRXIJiUMAN IROQU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 2.7 

409487 H19886 gb:yn57a05.rt Soares adult brain N2b5H 2.3 

20 4 16040 AW819158 Hs.289044 Homo sapiens cDNA FU12048 fis, clone HE 7.4 
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TABLE 4 A 
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Table 4 A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



30686.-15 

371681J 

1650_-5 

656394J 

1642364J 

1657509J 

31281,-28 

11052.-2 

19346J4 . 



160212J 
328626J 
44573.2 



. 120280 
30 113248 
120472 



35 



40 129019 44573^2 



120695 9683.3 



122188 
121581 
122618 
109026 
123658 
123811 
125115 
125147 
118737 
120274 
113196 
120504 
120639 



275673.1 
283769J 
305217J 
150431J 



AA609170 

AA6Q2964M609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699 1 AA080912AA075318AA083403AA076594 AA078992AA084926AA081881 AA113913 AA113892 
AA083B21 AA134801 AA082953 AA070343 AA062835 AA075419 AA063293 AA071252 AA078900 AA062836 AW974305 

AA190577AA181657 

T63857 AW971220 AA493469 T63699 

AJ950087 H702D8 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 A1820532 W87891 T85904 U71456T82391 BE328571 T75102 R34725 AA884922 BE328517 
A!21 9788 M884444 N92578 F13493 AA927794 A1560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 A1950344 AI741346 A1689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 A1653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA300207 AW959581 T63226 F04005 
A1950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87B91 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134Q43 AW235363 AA663345 AW008282 AA488964 
AA283144 AJ890367 AI950344 AI741346 AI669062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 A1653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538AA180009 M337499AW961101 AA251669 AA251874 
AI819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 M780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 A1740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AA425910 AI017004 A1241295 AA402816 AA291468 
AA398838 AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641M454061 



AA157811AA836869 
genbankJ\A609364 AA609364 
genbank J\A62Q586 AA620586 
genbank_T97341 T97341 
NOTJ=OUND_entrazJV38150 
382979.1 AA199686 N73861 
genbanJeM177051 AA177051 
genbanKJ57317 T57317 
genbanK_AA256837 AA256837 
genbanRJVA286942 AA286942 



W38150 
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120809 genbankJ\A346495 AA345495 
113702 genbankJ97307 T97307 

129680 23162 1 U03749 NMJJ01275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 D55958 A1684005 D53170 AA854091 AI025609 D53119 D54729 D55504 055377 D55313 AW512244 AA846441 
5 AW043696 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 

AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 A1686818 054392 A1022485 AA431410 AA854232 W39212 W15214 AA894441 
AJ803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 AA976699 A1687O01 
A1621107 A1865540 AA772107 G06286 AA319661 AA405992 
10 101045 entrezJ05614J05614 

117247 genbank N21032 

110501 genbank H55748 

103392 entre2LX94563X94563 

105032 genbank AA127818 
15 119513 NOT^OUNDjjntrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 
20 123315 714071J AA496369AA496646 

114911 genbank AA236672 

409487 1134778 J H19886 AW402806 T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

10 Unigene Tide: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



^ Pkey ExAccn UnigenelD UnlgeneTitle 

100114 X02308 Hs.82962 
100147 D13666 Hs.138348 
100154 H60720 Hs.81892 
100335 AW247529 Hs.6793 
20 100666 L05424 Hs.169610 

100667 L05424 Hs.169810 

100668 105424 Hs.169610 
100678 AW502935 Hs.740 
100988 AK000405 Hs.76480 

25 101031 J05070 Hs.151738 

101045 J05614 

101332 J04088 Hs.156346 

101352 AI494299 Hs.16297 

101580 NM_012151Hs.83363 

30 101592 AF064853 Hs.91299 

101767 M81057 Hs.180884 

101806 AA586894 Hs.1 12408 

101810 NMJX»318Hs.18Q612 

101983 AI904232 Hs.75323 

35 102107 BE258602 Hs.182366 

102165 BE313280 Hs.159627 

102198 AW950852 Hs.74598 

102217 AA829978 Hs.301613 

102220 U24389 Hs.65436 

40 102302 AA306342 Hs.69171 

102348 U37519 Hs.87539 

102374 U33635 Hs.90572 

102455 U48705 Hs.75562 

102568 W81489 Hs.223025 

45 102618 AU037672 Hs.81071 

102687 NM.007019HS.93002 

102689 U96132 Hs.171280 

102704 AU077058 Hs.54089 

102705 T97490 Hs.50002 
50 102801 BE252241 Hs.38041 

102827 BE244588 Hs.6456 

103060 Ni\L005940Hs.155324 

103080 AU077231 Hs.82932 

103178 AA205475 Hs.275865 

55 103206 X72755 Hs.77367 

103238 AI369285 Hs.75189 

103547 AI376722 Hs.180062 

103549 BE270465 Hs.78793 

103886 AKD01278 Hs.105737 

60 104325 BE379766 Hs.150675 

104827 AW052006 Hs.8551 

104846 AI250789 Hs.32478 

104854 AA041276 Hs.154729 

104867 AA278898 Hs.225979 



osteoblast specific factor 2 (fasctctin 
KIAA0101 gene product 



CD44 antigen (homing function and Indian 
CD44 antgen fjioming function and Indian 
CD44 antigen (homing function and Indian 
PTK2 protein tyrosine kinase 2 
ubiquian-IIke 4 

matrix metafloproteinase 9 (geiatinase B 
gb:Human proliferating cell nuclear anti 
topoisomerase (DNA) II alpha (170kO) 
COX17 (yeast) homotog, cytochrome c oxid 
coagulation factor Vill-associated (intr 
guanine nucleotide binding protein ( 
carboxypeptkJase 81 (tissue) 
S100 cateiurrvbinding protein A7 (psorias 
peroxisomal membrane protein 3 (35kD, Ze 
prohlbltin 

heat shock protein 75 
death associated protein 3 
polymerase (DNA directed), delta 2, regu 
JTV1 gene 



protein kinase C-Cke 2 
aldehyde dehydrogenase 3 family, member 
PTK7 protein tyrosine kinase 7 
discoldin domain receptor family, member 
RAB31, member RAS oncogene family 
extracellular matrix protein 1 
ubiquitin carrier protein E2-C 
hydroxyacyl-Coenzyrne A dehydrogenase, ty 
BRCA1 associated RING domain 1 
small inducible cytokine subfamily A (Cy 
pyridoxal (pyridoxine, vitamin B6) klnas 
chaperonm containing TCP1, subunit 2 (b 
matrix metaitoproteinase 11 (MMP11; stro 
cydin D1 (PRAD1: parathyroid adenomatos 
ribosomal protein S18 
monokine induced by gamma interferon 



proteasome (prosome, macropain) subunit, 
protein kinase C, zeta 
hypoftefical protein FU10416 similar to 
polymerase (RNA) II (DNA directed) polyp 
PRP4/STK/WD splicing factor 
ESTs 

3-phospho)nositide dependent protein kin 
hypothetical protein similar to small G 



R1 

Z9 
7.5 
9.2 
2.7 
5.7 
9 

7.6 

53.2 

11.4 

8.2 

5 

3.4 

6.3 

5.7 

5.6 

14.4 

8.9 

3.2 

8.4 

1.4 

4.6 

43 

6.7 

4.3 

2.7 

2 

6.2 
6.9 
5.3 
5.8 
4.3 
6 

1.9 

2.3 

6.4 

5.6 

45 

3.1 

95 

8.8 

5.6 

9.7 

7.9 

63 

6.3 

105 

5.6 

12.3 

2 
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104909 
104916 
104919 
104974 
104978 
105012 
105039 
105079 
105088 
105393 
105547 
105564 
105658 
105746 
105858 
105930 
106094 
106350 
106359 
106610 
106624 
106713 
106829 
106846 
106873 
106973 
107029 
107197 
107859 
107901 
107922 
107974 
108274 
108647 



109011 
109068 
109273 
109468 
110240 
110330 
110501 
110742 
110762 
110856 
110958 
111125 
111179 
111239 
111285 
111392 
111937 
112244 
112995 
113777 
113791 
113811 
113834 
113868 
113870 
113923 
114275 
114895 
114965 
115061 



AW015318 Hs.23165 
AW408164 Hs.249184 
AW958157 Hs.155489 
AA026880 Hs.25252 
Y12059 Hs.278675 
AI199268 Hs.19322 
AF098158 Hs.9329 
AA907305 Hs.36475 
AA151342 Hs.12677 
H58589 Hs.35156 
AF167570 Hs.256583 
AA262640 Hs.27445 
BE616694 Hs.288042 
AA985190 Hs.246875 
AW151952 Hs.46679 
AF151066 Hs.281428 
AF016371 Hs.9880 
AA533491 Hs.23317 
AK001404 Hs.194698 
AW390282 Hs.31130 
AA458882 Hs.79732 
NMJ>03595Hs.26350 
BE614802 Hs.184352 
AW959893 Hs.27099 
AB037744 Hs.34892 
N49809 Hs.11197 
BE156256 Hs.11923 
AF264750 Hs.288971 
W15477 Hs.64639 
AW732573 Hs.475B4 
L42612 Hs.335952 
BE153855 Hs.61460 
AW956103 Hs.61712 
AF129535 Hs.272027 
BE546947 Hs.44276 
AB029000 Hs.70823 
AK001431 Hs.5105 
AA156542 Hs.72127 
AA164293 Hs.72545 
AA375752 Hs.82719 
NM_015310Hs.6763 
AI668594 Hs.176588 
AI288666 Hs.16621 
H55748 

AW190338 Hs.28029 
BE044245 Hs.30011 
AA992380 

NMJ)05864Hs.24587 
N63823 H^269115 
AK000136 Hs.10760 
N9Q956 Hs.17230 
AA778711 Hs.4310 
W46342 Hs.325081 
BE298665 Hs.14846 
AB029000 Hs.70823 
M737033 Hs.7155 
BE266947 Hs.10590 
AI269096 Hs.135578 
BE207460 Hs.6994 
T26483 Hs.6059 
W57902 Hs.90744 
AL079314 Hs.16537 
AW953484 Hs.3849 
AW515443 Hs.306117 
AA236177 Hs.76591 
AI733881 Hs.72472 
AI751438 Hs.41271 



ESTs 

transcription factor 19 (SC1) 
NS1 -associated protein 1 
prolactin receptor 
bromodomain-contalning 4 



17.7 
5 

1.7 
1.4 
1.4 



Homo sapiens, Similar to RJKEN cONA 2010 72 

chromosome 20 open reading frame 1 3.3 

ESTs 2.5 

CGM47 protein 9.5 
Homo sapiens cONA FU 11027 fis, done PL 22 

Interleukin enhancer binding factor 3, 9 5.4 

unknown 9.3 

hypothetical protein FU14299 1.4 

hypothetical protein FU20059 9.4 

hypothetical protein FU20739 1 .5 

hypothetical protein 2.9 

peptidyt prolyl (somerase H (cyclophiiin 5.2 

hypothetical protein FU 14681 6.8 

cycfin B2 5.7 

transmembrane 7 superfamfly member 2 6.3 

fibuGnl 7.9 

tyrosylprotein suifotransferase 2 7.7 

hypothetical protein FU12549 4.5 

hypolheticaJ protein FU23293 similar to 16.2 

KIAA1323 protein 2.2 
Homo sapiens, done 1MAGE:3343149, mRNA, 16.8 



hypothetical protein 
myeloid/lymphoid ormixed-iineage leukem 



6.6 
1,8 
6.1 
8.4 
2.5 
2.2 
6.7 
7.1 
9.8 
72 
4 

1.4 

2.9 



potassium vottage-galed channel, delayed 
keratin 6B 

)g superiamify receptor LNIR 
pyruvate dehydrogenase kinase, Isoenzyme 
F-box only protein 5 
homeo box C10 
KIAA1077 protein 
hypothetical protein FU 10569 
ESTs 
ESTs 

Homo sapiens mRNA; cONA DKFZp586F1822 (f 2.9 
KIAA0942 protein 3.2 
ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 
DKF2P434I116 protein 62 
g b:yq94a01 .si Soares fetal liver spleen 6.1 
hypothetical protein MGC1 1256 7.6 
hypothetical protein MGC2963 9.3 
gb:ot37g06.$1 SoaresJeslisJW Homo sap 2.3 
signal transduction protein (SH3 contain 6.7 
ESTs, Moderately similar to Z195.HUMAN Z 3.6 
asporln(LRRdass1) 7,1 
hypothetica! protein FU22087 7J9 
eukaryoiic translation initiation factor 6.9 
Homo sapiens, done IMAGE:3659680, mRNA, 8.4 
Homo sapiens mRNA; cONA DKFZp564D016 (fr 10.6 
KIAA1077 protein 14.6 
ESTs, Moderately similar to 2115357A TYK 5.6 
zinc finger protein 313 13.4 
chitobiase.di-N-acetyl- 1.3 
Homo sapiens cONA: FU22044 fis, done H 3.1 
EGF-containlng fibulin-Oke extra celluia 1 1 .3 
proteasome (prosome, macropain) 26S subu 2.7 
hypothetical protein, similar to (U06944 6.1 
hypotheCcal protein FU22041 similar to 1.9 
WAA0306 protein 15.8 
K1AA0887 protein 7.1 
BMP-R1B 2.3 
Homo sapiens mRNA fufl length Insert cDN 1 1.8 
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115278 
115291 
115652 
115693 
115941 
115968 
116011 
116417 
116470 
116637 
117132 
117881 
118528 
119075 
119265 
119349 
119403 
119789 
120206 
120253 
120297 
120325 
120327 
120349 
120356 
120371 
120383 
120386 
120389 
120396 
120418 
120472 
120484 
120570 
120582 
120596 
120624 
120695 
120713 
120750 
12)774 
120807 
120809 
120984 
121081 
121408 
121505 
121508 
121513 
121549 
121558 
121655 
121744 
121748 
121773 
121832 
121839 
121882 
121911 
121999 
122013 
122036 
122356 
122371 
122372 
122460 
122490 



AK002163 
BE545072 
BE093589 
AF231023 
AI867451 
AB037753 
AL359053 
AW499664 
AI272141 
AK001043 



AF161470 
A1949952 
M10905 
BE539706 
T65004 
AL117554 
BE393948 
H26735 
M131376 
M191384 
AA195651 
AK000292 
AW969481 
AF000545 
AA219305 
AL109963 
AW969665 
AW967985 
AA134006 
AW966893 
AI950087 
AA253170 
AA280679 
BE244830 
AA282074 
AW4079B7 
AA976503 
AW449855 
A1191410 
AI608909 
AA346385 
AA346495 
BE262951 
AA398721 
AA406137 
AA494172 
AA402515 
AM16653 
AA412477 
AA412497 
AA421537 
AA398784 
BE536911 
AB033022 
AW340797 
AA425691 
AA426376 
M427950 
AA430211 
AA431085 
W92142 
AA443794 
AA868555 
AA446008 
AW418788 
AA448349 



1.5 

6.2 

10.6 

6.8 

5.5 

9.8 

2.4 

7.4 

2.1 

2.7 

5.2 

5.7 

7.4 

5.7 

1.4 

8.4 

6.7 

9.2 



Hs.301724 hypothetical protein FU11301 
Hs.122579 hypothetical protein FU10461 
Hs.38178 hypothetical proteh FU23468 
Hs.55173 cadherin, EGF LAG seven-pass G-type rece 
Hs.46679 hypothetical protein FU20739 
Hs.62767 K1AA1332 protein 
Hs.57664 Homo sapiens mRNA fuQ length insert cDN 
Hs.12484 Human clone 23826 mRNA sequence 
Hs.83484 SRY (sex determining region YH>ox 4 
Hs.92033 Integrin-linked kinase-associated serine 
Hs.42315 plodding protein 
Hs.260622 butyrate-induced transcript 1 
Hs.49397 ESTs 
HS.28782G fibronectin 1 
Hs.285363 ESTs 
Hs.163561 ESTs 

Hs.1 19908 nucleolar .protein NOP5/NOP58 
Hs.50915 kallikrein 5 (KLK5; KLK-L2; stratum com 
Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 
Hs.326401 fibroblast growth factor 12B 38.9 
Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 15.2 
Hs.104106 ESTs 

Hs.278732 hypothetical protein FU20285 
Hs.55189 hypomefical protein 
Hs.296433 putative purlnergic receptor 
Hs.104196 EST 

Hs.123122 FSH primary response (LRPR1, rat) homoto 
Hs,154848 hypothetical protein DKFZp434D0127 
Hs.325572 ESTs. Moderately similar to ALU7.HUMAN A 21.7 
Hs.79306 eukaryotic translation Initiation factor 1 2.5 
Hs.26613 Homo sapiens mRNA; cONA DKFZp5B6F1323 (f 1 1 .4 
gbwq05c02jc1 NCI CGAP_Wd12HomosapIen 19.4 
EST 10.4 
EST s, Weakly similar to ALU1J4UMAN ALU S 14.4 
ZNF13Mike protein 10.2 
N-acetylgkjcosamine-phosphatfi mutase 7.5 
MiJhasephosphoproteinhomolog 52 
gb:oq30a04.s1 NCI__CGAP_GC4 Homo sapiens 46.8 
Homo sapiens cONARJ 12727 fis, clone NT 5.9 
ESTs, Moderately similar to 21 09260A B c 7 
ESTs 7.8 
SH3^»ntaining protein SH3GLB2;KIAA1848 6.8 
gb:EST52657 Fetal heart II Homo sapiens 4.4 
ESTs 

ESTs, Highly similar to I37550 mismatch 
EST 
ESTs 
ESTs 
ESTs 
EST 

gbzt95g12.s1 Scares Jestis_NHT Homo sap 2.8 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 



6.4 

16.1 

163 

28.1 

12.4 

9.7 

32.6 



Hs.96473 
Hs.271445 
Hs.284228 
Hs.237323 
Hs.173518 

Hs.96557 
Hs.96693 
Hs.193985 
Hs.30002 

Hs.99052 
Hs.186749 
Hs.98019 
Hs.194417 
HS.97887 
Hs.181510 
Hs.98142 

Hs.178072 
Hs.97514 
Hs.234545 
Hs.158654 
Hs.98434 
Hs.191606 
Hs.98459 

Hs.98668 
Hs.98706 
Hs.271963 
Hs.98390 
Hs.178222 
HsJ36677 
Hs.99148 
Hs.238151 



5.6 
5.4 
6 

13.1 
28 
6.2 
7.4 



7.1 
19.5 
7.9 
5.8 
5 
5 



ESTs 

hypothetical protein NUF2R 
WAA1196 protein 
ESTs 

ESTs, Highly similar to WAA1048 protein 
ESTs 

gb:zw50f02^1Soares^totaLfetusJ4b2HF8_ 7.2 

EST 6.4 

ESTs 6.5 

ESTs, Weakly similar to ALU5_tfUMAN ALU S 13.1 

ESTs 7.3 

ESTs 5 

EST 7.6 

ESTs, Weakly simflar to S43569 R01H10.6 97 

EST 6.1 
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122492 AA448417 Hs.104990 
122510 AA449232 Hs.99195 
122530 AW959741 Hs.40368 
122572 M452601 Hs.99287 
5 122607 AA453518 Hs.98023 
122614 AA453630 Hs.99339 
122616 AA453638 Hs.161873 
122618 AA453641 
122622 AA453987 Hs.144802 

10 122717 AA456859 Hs.178358 
122829 AW204530 Hs.99500 
122838 AA460584 Hs.334386 
122856 AI929374 Hs.75367 
122868 AF005216 Hs.115541 

15 122907 AA470074 Hs.169896 
123016 AW338067 Hs.323231 
123034 AL359571 Hs.44054 
123136 AW451999 Hs.194024 
123152 AW601773 Hs.270259 

20 123394 AA731404 Hs.105510 
123466 AA599042 Hs.112503 
123486 BE019072 Hs.334802 
123615 AA609170 
123735 NMJ)13241Hs.95231 

25 123753 AA609955 Hs.234951 
124006 AI147155 Hs.270016 
124385 A1267847 
124440 AA532519 Hs.129043 
124656 AW297702 Hs.102915 

30 124683 AA381661 Hs.1 19878 
124735 R22952 Hs.268685 
124761 AA374756 Hs.93560 
124768 AW368528 Hs.100855 
124788 R43543 Hs.100912 

35 124811 R46068 
124812 R47948 
124822 AA418160 Hs.86043 
124860 R65763 Hs.101477 
124903 AW296713 Hs.221441 

40 124930 AI076343 Hs.173939 
124942 R99978 
125051 T79956 
125056 T81310 
125101 AU72068 

45 125115 T97341 

125280 A1123705 Hs.106932 
127274 AW966158 Hs.58582 
128528 R39234 Hs.251699 
126670 AA975486 Hs.103441 

50 128691 W27939 Hs.103834 
128772 BE302796 Hs.105097 
128781 N71826 Hs.105465 
128797 NNL002975H3.105927 
128868 AA419008 Hs.106730 

55 128891 F34856 Hs.292457 
128946 Y13153 Hs.107318 
128975 BE560779 Hs.284233 
128995 AI816224 Hs.107747 
129019 AI950087 

60 129076 AW296806 Hs.326234 
129088 AA744610 Hs.194431 
129096 AA463189 Hs.288906 
129198 N57532 Hs.109315 
129347 BE614192 Hs.279869 

65 129362 U30246 Hs.1 10736 
129372 NM_016039Hs.1 10803 
129404 AI267700 Hs.317584 



Hs.288912 
Hs.188732 



Hs.268892 
Hs.100588 
Hs.100592 
Hs.286236 



ESTs 5.4 
ESTs 112 
adaptor-related protein complex 1 , sigma 1 0.1 
EST 11 
ESTs 615 
EST 10.7 
ESTs 107.3 
gb:zx48e06.s1 SoaresJestis_NHT Homo sap 31 .1 
ESTs 5.6 
ESTs 85 
ESTs 815 
ESTs 755 
Src-fike-adapter 55 
Janus kinase 2 (a protein tyrosine kinas 5.3 
ESTs 115 
Homo sapiens cDNA FU11946 fis, clone HE 2.8 
nineln (GSK3B interacting protein) 8.7 
ESTs 5.1 
ESTs 5.2 
ESTs 3.6 
EST 7.4 
Homo sapiens cONA FU14680 fis, done NT 2.4 
gbaf12a12^1 Soares_testis_NHT Homo sap 75 
FH1/FH2 domain-oontetning protein 10 
Huntingtin Interacting protein E 30.6 
ESTs 8.1 
gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DMA sequence from clone 989H1 1 on 75 
ESTs 85 
ESTs,WeakiysimilartoM3K9JIUMANMrrOG 7.9 
ESTs 115 
Homo sapiens mRNA forKIAA1771 protein, 9 
ESTs 8.1 
Homo sapiens cDNA: FU22726 fis, clone H 5.1 
hypothetical protein FU22604 142 
ESTs 7.9 
Homo sapiens cDNA FU13558 fis, clone PL 6.6 
EST 23.9 
ESTs 32.4 
ESTs, Weakly similar to ALUB.HUMAN HII 22.8 
ESTs, Moderately similar to B34087 hypot 6.1 
EST 135.3 
ESTs 5.4 
KIAA1856 protein 55 
gbye57e05.s1 Scares fetal fiver spleen 9.6 
ESTs 8 
Homo sapiens cONA FU12789 fis, done NT 12.8 
ESTs, Weakly similar to IDN4-6GTR14 [H.s 25 
Homo sapiens, Similar to RIKEN cONA 1700 7.1 
hypothetical protein MGC5576 7.7 
thymidine kinase 1 , soluble 55 
small nuclear ribonucteopratein polypept 53.9 
stem cell growth factor lymphocyte seer 135 
chromosome 22 open reading frame 3 3 
Homo sapiens, done M6C:16362, mRNA, com 135 
kynurenlne3-monooxygenase(kynurenine3 72 
NICE-5 protein 14 
DKF2P566C243 protein 1.9 
gb.-wq05c02.x1 NCLCGAP_Kid12Homosapien 2.9 
ESTs, Highly similar to T46422 hypotheti 5 
paHadin 17.1 
WW Domain-Containing Gene 205 
WAA1415 protein 5.6 
melanoma-associated anOgen recognised b 7.6 
solute earner family 1 2 (sodium/potass! 6.7 
CG1-99 protein 2 
ESTs 5 
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129482 AA188185 Hs.289043 spindEn 6.7 

129559 W01296 Hs.11360 hypothetical protein FU14784 7.5 

129587 H14718 Hs.11506 Human clone 23589 mRNA sequence 6.8 

129629 AK000398 Hs.11747 hypothetical protein FU20391 3.8 

5 123649 AO000092 Hs.16488 catreticulin 3.3 

129680 U03749 gb:Human chromogranin A (CHGA) gene, pro 14.1 

129689 AW748482 Hs.77873 B7homotog3 Z6 

129702 AI304966 Hs.12035 ESTs, Weakly similar to 138022 hypotheti 7.4 

129720 AA156214 Hs.12152 APMCF1 protein 2 

10 130010 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 1.6 

130097 AL046962 Hs.14845 forkhead box 03A 2.8 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

130211 NM.003358HS.23703 ESTs, Moderately similarto CEGT_HUMAN C1.6 

130242 X79201 Hs.153221 synovial sarcoma, translocated to X chro 5.4 

15 130359 NMJ)13449Hs.277401 bromodomain adjacent to zinc finger doma 8.5 

130365 W56119 Ks.155103 euxaryotfc translation initiation factor 11 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.155956 N-acetyltransferase 1 (arytamine N-acety 33.6 

130471 AL121438 Hs.183706 adducin 1 (alpha) 2.7 

20 130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage ofigomeric matrix protein (pse 6.1 

130542 U64675 Hs.179825 RAN binding protein 24ike 1 7.6 

130553 AF062649 Hs.252587 pituitary tumor-4ransforming 1 14.4 

130556 AI907018 Hs.15977 EmpiricaDy selected from AFFX single pr 4.7 

25 130567 AA383092 Hs.1608 repDcaflon protein A3 (14kD) 7.9 

130574 AF083208 Hs.16178 apoptosls antagonizing transcription fee 1.2 

130617 M90516 Hs.1674 glutamine^ructose-6-ptwsphatetrar^min 12.1 

130667 BE246961 Hs.17639 Homo sapiens ubiquitin protein ligase (U 13.9 

130693 R68537 Hs.17962 ESTs 2 

30 130744 H59696 Hs.18747 POP7 (processing of precursor, S.cerevi 3.1 

130757 AL036067 Hs.18925 protein x 0001 5.7 

130880 BE514434 Hs.20830 WnesMke 2 2.1 

130944 BE382657 Hs.21486 signal transducer and activator of trans 5.4 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

35 131060 AA194422 Hs.22564 myosin V! 5.1 

131099 AL133353 Hs^26581 COX15 (yeast) homolog, cytochrome c oxkJ 7 

131135 NMJ)16569Hs.267182 TBX34so protein 3.3 

131185 BE280074 Hs.23960 cycBnBI 5.8 

131225 H62087 Hs.31659 thyroid hormone receptor-associated prot 7.5 

40 131245 AL080080 Hs.24766 thloredoxin domain-containing 2.8 

131283 X80038 Hs.339713 Homo sapiens done F19374APOE-C2 gene 1.3 

131569 AL389951 Hs.271623 nucleoponn 50kD 5 

131643 AW410601 Hs.30026 HSPC182 protein 2.9 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

45 131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransf 3.4 

131760 X76732 Hs.3164 nucleoblndin 2 2.9 

131793 AW966127 Hs.32246 Homo sapiens cDNA RJ14656 fis. done NT 7.9 

131885 BE502341 Hs.3402 ESTs 13.7 

131900 AA099014 Hs.231029 Homo sapiens, done MGC:15961,mRNA. com 8.7 

50 131905 AA179298 Hs.3439 stomatin-Oke 2 113 

131941 BE252983 Hs.35086 ubiquttin spectSc protease 1 2.3 

131971 BE567100 Hs.154938 hypofrietical protein MDS025 3.5 

132180 NM.004460Hs.418 fibroblast activation protein, alpha 14.7 

132203 NM.004782HS.194714 synaptosomaKassodated protein, 29kD 7.8 

55 132273 AA227710 Hs.43658 DKFZP586L151 protein 10 

132288 N36110 Hs.305971 solute carrier family 2 (facilitated ghi 9.2 

132294 AB023191 Hs.44131 KIAA0974 protein 2 

132348 AW067708 Hs.170311 heterogeneous nuclear ribonudeoproteto 12.5 

132370 AW572805 Hs.46645 ESTs 28.3 

60 132384 AA312135 Hs.46967 HSPC034 protein 6.1 

132450 AA100012 Hs.48827 hypothetical protein FU12085 8.6 

132465 AW169847 Hs.49169 WAA1 634 protein 6.1 

132532 AA454132 Hs.5080 milcdx>ndrial ribosomal protein L16 7.1 

132574 AW631437 Hs.5184 TH1 drosophSa homolog 14 

65 132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 99212.4 

132718 NrVL004600Hs.554 Sjogren syndrome antigen A2 (60kD, ribon 3.7 

132726 N52298 Hs.55608 hypoffieQcal protein MGC955 14 2 
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132731 AI189075 Hs.301872 hypolhetlcat protein MGC4840 5.9 

132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 6.4 

132773 AA459713 Hs.295901 KIAA0493 protein 14.6 

132798 AI026701 Hs.5716 KIAA0310 gene product 2.5 

5 132810 AB007944 Hs.5737 KJAA0475 gene product 4.2 

132833 U78525 Hs.57783 eukaryotic translation Initiation factor 6.1 

132842 NM__016154Hs.279771 Homo sapiens clone PP1596 unknown mRNA7.1 

132851 U09716 Hs.287912 lectin, mannose-bindlng, 1 6.1 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary fac 2.7 

10 132941 A1817165 Hs.6120 hypothetical protein FU 13222 2.1 

132972 AA034365 Hs.288924 Homo sapiens cONA FU11392 fis, clone HE 3.5 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA1 12748 Hs.279905 doneHQ0310PRO0310p1 17.1 

133016 AI439688 Hs.6289 hypothetical protein FU20886 4.4 

15' 133177 X97795 Hs.66718 RAD54 (S.cerevisiae)-like 4.4 

133208 AI801777 Hs.6774 ESTs 5.5 

133254 A1567421 Hs.273330 Homo sapiens, clone IMAGE:3544662, mRNA, 1.3 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2_HUMAN FORKH 12.2 

20 133285 M76477 Hs.289082 GM2 ganglioside activator protein 10.4 

133390 A1950382 Hs.72660 phosphatldytserine receptor 5.7 

133391 AW103364 Hs.727 inhibin, beta A (activin A, activin AB a 25.5 
133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subu 1.7 
133594 AW160781 Hs.172589 nuclear phosphoprotein similar to S. cer 2.6 

25 133621 NMJXM893Hs.75258 H2A hlstone famfly, member Y 13.5 

133720 L27841 Hs.75737 pericentriolar material 1 6.7 

133760 BE271766 Hs.181357 lamlnin receptor 1 (67kD t ribosomai prot .5.4 

133784 BE622743 Hs.301064 arfaptin 1 12.1 

133791 M34338 Hs.76244 spermidine synthase 9.7 

30 133797 All 33921 Hs.76272 retinoblastoma-binding protein 2 1.3 

133822 D50525 Hs.699 peptidylprotyt isomerase B (cyctophiiin 9.7 

133850 W29092 Hs.7678 cellular retinoic ackttinding protein 1 4.2 

133865 AB011155 Hs.170290 discs, targe (DrosophDa) homolog 5 5 

133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosin 9.1 

35 133924 D86326 Hs.325948 vestete docking protein pi 15 1.8 

133959 X81789 Hs.77897 splicing factor 3a, subunit 3,601(0 10.4 

133989 AU040328 Hs.78202 SWI/SNF related, matrix associated, act! 2.6 

133997 AJB24113 Hs.78281 regulator of G-protein signalling 12 13 

134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, 10.3 

40 134348 AW291946 Hs.82065 rnterleukin 6 signal transducer (gp130, 6.7 

134376 X06560 Hs.82396 2 , l 5'-o^oadenylate synthetase 1 (40-46 5.5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 5.8 

134405 AW067903 Hs.82772 collagen, type XI. alpha 1 72.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 6.7 

45 134480 NfVL005000Hs.83916 Empirically selected from AFFX single pr 6.2 

134516 AK001571 Hs.273357 hypothetical protein FU10709 1.4 

134529 AW411479 Hs.848 FK506-bindtng protein 4 (59kD) 2.8 

134751 AW630803 Hs.89497 laminBI 6.1 

134790 BE002798 Ks.287850 integral membrane protein 1 12 

50 134606 AD001528 Hs.89718 spermine synthase 2.6 

134850 AI701162 Hs.90207 hypothetical protein MGC1 11 38 9.1 

134859 D26488 Hs.90315 KIAA0007 protein ' 13.3 

134971 A1097346 Hs.286049 phosphoserine aminotransferase 2 

135181 BE250665 Hs.279529 px19-like protein 14.9 

55 135207 N26427 Hs.9634 ESTs, Highly similar to C10_HUMAN PUTAT1 1.7 

135245 AI028767 Hs.262603 ESTs 12.2 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-finked 7.6 

135307 AI743770 Hs.98368 ESTs, Weakly similar to WAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 12.3 

60 135354 AA456454 Hs.183418 cell division cycle 2-Qke 1 (PtTSLRE pr 5.7 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c 5.3 

317781 NMJ07057HS.42650 2W10 interactor 2.8 

321114 AA902256 Hs.78979 Golgl apparatus protein 1 5.5 

65 322556 BE041451 Hs.177507 hypothetical protein 2.9 

420802 U22376 Hs.1334 v-myb avian myelobbstosis viral oncogen 2.3 

424001 W67883 Hs.137476 paternally expressed 10 {PEG10; KIAA105 7 
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425182 AF041259.HS.155G40 

446999 AA151520 Hs.334822 

450701 H39960 Hs.288467 

452461 N78223 Hs.108106 

453157 AF077036 Hs.31989 



zinc finger protein 217 2.3 
hypothetical protein MGC4485 7.5 
Homo sapiens cONA FU12280 fts, clone MA 5.6 
transcription factor 4.7 
OKFZP586G1722 protein 12.1 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigeneED's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
rnRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



10 Pkey: 

CAT number 
Accession: 



Unique Eos probeset Wenfifier number 
Gene cluster number 
Genbank accession numbers 



15 Pkey CAT number Accessions 



20 



25 



123615 
124385 
110856 
120472 



30 129019 44573_2 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



30686.-15 AA609170 
656394 J AI267847 N27351 

19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 

44573J2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87691 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA8B4444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 A1890387 AI950344 AI741346 A1689062 AA282915 AW102898 A1672193 AJ763273 AW1 73586 
AW150329 A1653832 AI762688 AA988777 AA488892 A1356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 A1819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA300207 AW959581 T63226 
F04005 

A1950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AJ820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 A1890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683J AA976503 AI9170O2 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 
AI796100 AI935603 AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 

305217J AA453641AA454061 

genbankJT97341 T97341 

genbanKJ\A346495 AA346495 

23162.1 U03749 NM001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843B75 A1829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 Ai025609 D531 19 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA78247B AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 A1584185 C05724 AA789023 AI68681 8 D54392 A1022485 AA431410 AA854232 W3921 2 W15214 
AA894441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621 107 A1865540 AA772107 C06286 AA319661 AA405992 

entrezJ05614J05614 

genbank w H55748 H55748 

genbankJUV412497 AA412497 

genbankJVA427950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 



Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 

5 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
10 Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



^ Pkey ExAccn UnigenelD UnlgeneTitle R1 

100147 D13666 Hs.1 36348 osteoblast specific factor 2 {fascfciln 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101806 AA586894 Hs.112408 S100 calcium-binding protein A7(psorias 8.9 

102455 U48705 Hs.75562 dtsraidin domain receptor family, member 6.9 

20 103206 X72755 Hs. 77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 sema dcmrahJmnwnoglobuJin domain (Ig), 2.6 

105746 AW151952 Hs.46679 hypoflietical protein RJ20739 1.5 

106373 AW503807 Hs.21907 histone acetyttransferase 1.8 
110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 45 

25 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homobg 52 

30 120695 AA976503 gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; K1AA1848 6.8 

121508 M402515 HsS7B87 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

35 122618 AA453641 gb:zx48eQ6.s1 SoaresJestisJJHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 M609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gbaq49a10jc1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUBJiUMAN Ml 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucteoprotein potypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 205 

45 129347 BE614192 Hs.279869 melanoma-associated ant'gen recognised b 7.6 

129689 AW748482 Hs.77873 B7homotog3 2.6 

130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartBage ofigomeric matrix protein (pse 6.1 

131046 AA321649 Hs^248 small inducible cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 1 1 (y 2.7 

132180 NM.004460Hs.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 283 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 17.1 

55 133016 AI439688 Hs.6289 hypothetical protein FU20386 4.4 

133266 A1160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin. beta A (acfivin A, actMn AB a 255 

134169 A1690916 Hs.178137 transducer of ERBB2.1 \2 

134219 NNL000402Hs£0206 glucose-6-phosphate dehydrogenase 15 

60 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 723 

134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman cofled-coll protein 2.6 

135181 BE250865 Hs.279529 px19-fike protein 145 

322556 BE041451 Hs.177507 hypothetical protein 19 
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TABLE 6A 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
5 each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pfcey CAT number Accessions 
124385 656394J AI267847 N27351 

120695 9683J AA976503 A1917802 AA953664 AA404613 AA428771 BE280542 AW194691 A1927301 AI740458 A17961Q0 AI935603 

AW052210 AA970201 AI633384 AA425910 A1017004 A1241295 AA402816 AA291468 
122618 305217J AA453641 AA454061 



10 



Pkey: 

CAT number. 
Accession: 



Unique Eos probeset Identifier number 
Gene cluster number 
Genbank accession numbers 
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TABLE 7: Figure 7 from BRCA 001-1 US 

5 Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 
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Pfcey. 
ExAccn: 
UnigenelD: 
Ungene Title: 
R1: 

ORF struct info: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unkjene number 

Unkjenegene title 

Ratio of tumor to normal breast tissue 

Structural characterization of open reading frame for the sequence of the gene 



Pkey ExAccn UnigenelD 



100113 NM_001269 

100114 X02308 
100131 D12485 

100146 BE185499 

100147 D13666 
100154 H60720 
100163 W44671 
100220 AW015534 
100265 D38521 
100271 BE160081 
100275 BE242802 
100323 D50920 
100335 AW247529 
100364 NMJJQ4341 
100372 NMJ14791 
100393 D84145 
100400 AW954324 
100418 D86978 
100482 M65028 
100518 NMJ04415 

100666 L05424 

100667 L05424 

100668 L05424 
100678 AW502935 
100783 AF078847 
100892 BE245294 
100945 AF002225 
100969 AA157634 
100988 AK000405 
100999 H38765 
101031 J05070 
101045 J05614 
101077 N99692 
101093 L06419 
101186 AA020956 
101216 AA284166 
101228 AA333387 
101247 AA132666 
101249 L18964 
101332 J04088 
101352 AW94299 
101396 BE267931 
101445 M21259 
101470 NM.000546 
101478 NMJJ02890 



Hs.84746 

Hs.82962 

Hs.11951 

Hs.2471 

Hs.136348 

Hs.81892 

Hs.124 

Hs.217493 

Hs.1 12396 

Hs.256290 

Hs.154797 

Hs.23106 

Hs.6793 

Hs. 154868 

Hs.184339 

Hs.39913 

Hs.75790 

Hs.84790 

Hs.81361 

Hs.74316 . 

Hs.169610 

Hs.169610 

Hs.169610 

Hs.740 

Hs.191356 

Hs.180789 

Hs.180686 

Hs.79172 

Hs.76480 

Ks.80706 

Hs.151738 

Hs.75227 

Hs.75093 

Hs.179881 

Hs.84113 

Hs.82916 

Hs.78802 

Hs.1904 

Hs.156346 

Hs.16297 



Hs.1846 
Hs.758 



UnigeneTitle 


R1 


ORF struct info 


chromosome condensation 1 


2.3 


TM 


thymidylate synthetase 


2.9 


other 


ectonucleotide pyrophosphatase/phosphodl 


1.9 


other 


WAA0020 gene product 


1.9 


TM 


osteoblast specific factor 2 (fescfcfin 


7.6 


other 


WAA0101 gene pro&ct 


9.2 


other 


gene predicted from cDNA with a complete 


1.6 


other 


annexin A2 


2 


other 


iwvwu/ / p roie in 


1.5 


other 


S100 cakaum-binding protein A11 (cafgiz 


13.5 


other 


KIAA0090 protein 


5.1 


other 


KIAA0130gene product 


1.9 


TM 


platelet-activating {actor acetythydrola 


2.7 


other 


carbamoyt-pbosphate synthetase 2, aspart 


2 


other 


K1AA0175 gene product 


2.6 


other 


novel RGD-contalning protein 


3.2 


other 


phosphatidyfinositol giycan, class C 


1.5 


other 


K1AA0225 protein 


2 


other 


heterogeneous nuclear ribonucteoprotein 


2.9 


other 


desmopIakIn{DPI ( DPll) 


1.9 


other 


CD44 antigen (homing function and Indian 


5.7 


other 


CD44 antigen (homing function and Indian 


9 


? 


CD44 antigen (horning function and Indian 


7.7 


other 


PTK2 protein tyrosine kinase 2 


53.2 


other 


general transcription factor IIH, polype 


6 


other 


S164 protein 


1.7 


? 


ubiquln protein Rgase E3A (human papi 


1.5 


other 


solute carrier family 25 (mitochondrial 


6.3 


other 


ublquhMike 4 


11.4 


? 


diaphorase (NADH/NADPH) (cytochrome 


1.6 


other 


matrix metaCoprotehase 9 {gelatinase B 


8.4 


other 


gb:Human proliferating cell nuclear anti 


5 


? 


Empirically selected from AFFX single pr 


2.6 


other 


procollagen-lysine, 2-oxoglutarate 5-dlo 


1.4 


? 


core-binding factor, beta subunH 


2 


TM 


cydln-dependent kinase inhibitor 3 (COK 


1.8 


other 


chaperonin containing TCP1, subunit 6A ( 


1.7 


TM 


glycogen synthase kinase 3 beta 


1.9 


other 


protein kinase C, iota 


1.5 


other 


topoisomerase (DNA) II alpha (170kO) 


5.3 


other 


COX17 (yeast) homolog, cytochrome c oxkj 


4.2 


other 


proliferating cell nuclear antigen 


1.9 


TM 


gb^umanAlu repeats in the region S to 


1.6 


TM 


tumor protein p53 (U-Fraumeni syndrome) 


2.5 


other 


RAS p21 protein activator (GTPase activa 


5.5 


other 
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101483 M24486 
101540 J04977 
101573 AW248421 
101580 NMJ012151 
101592 AF064853 
101621 BE391804 
101702 AW50408S 
101734 M74099 
101759 M80244 
101767 M81Q57 
101782 AA306495 

101805 AW409747 

101806 AA586894 
101810 NM_000318 
101879 AA176374 
101911 AA441787 
101920 AF182645 
101973 U41514 
102009 BE245149 
102036 BE250127 
102083 T35901 
102107 BE258602 
102123 NM.O01809 
102165 BE313280 
102198 AW950852 
102217 AA829978 
102220 U24389 
102234 AW163390 
102260 AL039104 
102302 AA306342 
102330 BE298063 
102339 BE378432 

102348 U37519 

102349 AU077055 
102369 U39840 
102374 U33635 
102391 AA296874 
102455 U48705 
102465 NM_001359 

102488 U50939 

102489 AL080116 
102494 AI188137 
102501 AF217197 
102522 BE250944 
102532 AF040253 
102564 U59423 
102568 W81489 

102580 U60808 

102581 AU077228 

102582 U61232 

102617 AW161453 

102618 AL037672 
102627 AL021918 
102663 NMJW2270 
102676 BE262989 
102687 NMJJ07019 
102689 U96132 
102696 BE540274 

102704 AU077058 

102705 T97490 
102750 AB014460 
102801 BE252241 
102812 U90549 
102827 BE244588 
102844 AV653790 
102868 X02419 
102925 BE440142 



Hs.76768 


procollagen-proline, 2-oxoglutarate 4-di 


Z1 


other 


Hs.84981 


X-ray repair complementing defective rep 


1.6 


other 


Hs.250758 


proteasome (prosome, macropain) 26S subu 


5.7 


other 


Hs.83363 


coagulation factor Vlll-associated (intr 


1.8 


other 


Hs.91299 


guanine nucleotide binding protein (G pr 


5.6 


? 


Hs.62661 


guanylate binding protein 1, interferon- 


2.4 


other 


Hs.179574 


protein phosphatase 2 (formerly 2A), reg 


1.3 


other 


Hs.147049 


cut (Drosophila)-like 1 (CCAAT displacem 


2.1 


? 


Hs.184601 


solute carrier famDy 7 (cattonic amino 


5 


TM 


Hs.180884 


carboxypeptidaseBI (tissue) 


14.4 


SS, 


Hs.1869 


phosphoglucomutase 1 


5.2 


other 


Hs.75612 


stress-induced-phosphoprotein 1 (Hsp70/H 


8.6 


other 


Hs.1 12408 


S100 calcium-binding protein A7 (psortas 


8.9 


SSJM 


Hs.180612 


peroxisomal membrane protein 3 (35kD, Ze 


22 


TM 


Hs.243886 


nuclear autoantigenic sperm protein (his 


1.6 


other 


Hs. 11 9689 


glycoprotein hormones, alpha polypeptide 


31.3 


? 


Hs.8024 


IK cytokine, down-regulator of HLA II 


1.8 


other 


Hs.80120 


UDP-M-acetyi-a!pha-D-galactosamine:polyp 


2.4 


other 


Hs.82643 


protein tyrosine kinase 9 


1.3 


other 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


2 


7 


Hs.75117 


Interleukin enhancer binding factor 2, 4 


1.6 


other 


Hs.182366 


heat shock protein 75 


1.4 


other 


Hs.1594 


centromere protein A (17kD) 


1.8 


other 


Hs.159627 


death associated protein 3 


4.6 


? 


Hs.74598 


polymerase (DNA directed), delta 2, regu 


4.4 


? 


Hs.301613 


JTV1 gene 


6.7 


other 


Hs.65436 


lysosomal 


4.4 


TM 


Hs.278554 


heterochromatin-like protein 1 


1.9 


TM 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1 , impor 


4.4 


other 


Hs,69171 


protein kinase C-iflce 2 


2.7 


? 


Hs.77254 


chromobox homolog 1 (Drosophiia HP1 beta 


1.5 


other 


Hs.95577 


cycfin-dependent kinase 4 


2.3 


TM 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2 


TM 


Hs.289107 


bacuMal 1AP repeat-containing 2 


3.2 


other 


Hs.299867 


hepatocyte nuclear factor 3, alpha 


2 


other 


Hs.90572 


PTK7 protein tyrosine kinase 7 


6.2 


other 


Hs.77494 


deoxyguanosine kinase 


1.5 


TM 


Hs.75562 


discoldin domain receptor family, member 


7 


other 


Hs.81548 


2,4-dienoyf CoA reductase 1, mltochondrl 


1.8 


SS, 


Hs.61828 


amyloid beta precursor protein-binding p 


1.5 


? 


Hs.74420 


origin recognition complex, subunit 3 (y 


3.3 


other 


Hs.75193 


COP9 homolog 


2.1 


other 


Hs.74562 


siah binding protein 1; FBP interacting 


3.2 


other 


Hs.183556 


solute carrier family 1 (neutral amino a 


2.8 


? 


Hs.70186 


suppressor of Ty (S.cerevlsiae) 5 homoto 


5.7 


7 


Hs.79067 


MAO (mothers against decapentaplegic, Or 


2.3 


other 


Hs.223025 


RAB31, member RAS oncogene family 


5.3 ' 


other 


Hs.152981 


CDP-dlacylgfycerol synthase (phosphatida 


2.1 


other 


Hs.77256 


enhancer of zeste (Drosophiia) homolog 2 


1.6 


7 


Hs.32675 


tubulin-spedfic chaperone e 


2A 


other 


Hs.198767 


COP9 (constitutive photomorphogenfc, Ara 


1.8 


other 


Hs.81071 


extracellular matrix protein 1 


5.8 


other 


Hs.158174 


zinc finger protein 184 (Kruppel-fike) 


U 


other 


Hs. 168075 


karyopherin (importin) beta 2 


1.8 


TM 


Hs.12045 


putative protein 


2.3 


other 


Hs.93002 


ubiquitm carrier protein E2-C 


4.4 


? 


Hs.171280 


hydroxyacyi-Coenzyme A dehydrogenase, ty 


6 


? 


Hs.239 


forkhead box M1 


4.2 


other 


Hs.54089 


BRCA1 associated RING domain 1 


1.9 


other 


Hs.50002 


small inducible cytokine subfamily A (Cy 


2.3 


SS.TM 


Hs.66196 


nth (E.coQ endonudease III)-Gke 1 


12 


TM 


Hs.38041 


pyridoxal (pyridoxine, vitamin B6) kinas 


6.5 


other 


Hs.236774 


high-mobity group (nonhistone chromoso 


1.6 


other 


Hs.6456 


chaperonln containing TCP1, subuntt 2 (b 


5.6 


TM 


Hs.324275 


WW domain-containing protein 1 


U 


TM 


Hs.77274 


plasminogen activator, urokinase 


4.4 


other 


Hs.2943 


signal recognition particle 19kD 


1.9 


other 
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102935 BE561850 Hs.80506 small nuclear ribonudeoprotein potypept 

102968 AU076611 Hs.154672 mettiytene teirahydrofolafe dehydrogenase 

102983 BE387202 Hs.1 18638 norwnetastaifc cells 1, protein (NM23A) 

102985 U95742 Hs.2707 G1 to S phase transifion 1 

5 103023 AW500470 Hs.117950 muttifunctional polypeptide sintilar to S 

103038 AA926950 Hs.334883 CDC28 protein kinase 1 

103060 NMJXJ5940 Hs.155324 rnatrixmetaDoproteinasell (MMP11;stro 

103080 AU077231 Hs.82932 cydlnDI (PRAD1: parathyroid adenomatos 

103089 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 

10 103177 BE244377 Hs.48876 famesyWiphosphate famesyitransferase 

103178 AA205475 Hs.275865 ribosomal protein S18 

103179 NMJXJ1777 Hs.82685 CD47 antigen (Rh-related antigen, integr 
103181 X69636 Hs.334731 Homo sapiens, done 1MAGB3448306, mRNA, 
103185 NM_006825 Hs.74368 transmembrane protein (63kD), endopfesmi 

15 103191 AA401039 Hs^903 protein phosphatase 4 (formerly X), cata 

103193 NMJXM766 Hs.75724 coatomer protein complex, subunit beta 2 

103194 NMJ)Q4939 Hs.78580 OEAD/H (Asp-Glu-AIa-Asp/His) box poiypep 
103206 X72755 Hs.77367 monokine induced by gamma interferon 
103223 BE275607 Hs.1708 chaperonin containing TCP1, subunit 3 {g 

20 103232 X75962 Hs.129780 tumor necrosis (actor receptor superfami 

103238 AI369285 Hs75189 death-associated protein 

103297 NMJJ01545 Hs.9078 immature colon carcinoma transcript 1 

103330 AI803447 Hs77496 small nuclear ribonudeoprotein potypept 

103349 X89059 gb:H.sapiens mRNA for unknown protein ex 

25 103376 AL036166 Hs.323378 coated veside membrane protein 

103391 X94453 Hs.1 14366 pyrroBne^carboxytate synthetase (glut 

103392 X94563 gbiFisapiens dbl/acbp gene exon 1 & 2. 
103430 BE564090 Hs.20716 transtocase of Inner mitochondrial merabr 
103491 AF264750 Hs.288971 rnyetoiaVrymphoid or mixed-lineage leukem 

30 103505 AL031224 Hs.33102 transcription factor AP-2 beta (activati 

103547 AI376722 Hs.180062 proteasome (prosome, macropain) subunit 

103588 NMJW6218 Hs.85701 phosphoinosrtide-3-kinase, cataryfic, al 

103613 NM.00Q346 Hs.2316 SRY (sex determining region Y)-box 9 (ca 

103621 BE379766 Hs.150675 polymerase (RNA) it {DNA directed) polyp 

35 103622 AA609685 Hs.278672 membrane component chromosome 11, surfa 

103727 AI878883 Hs.296381 growth factor receptor-bound protein 2 

103754 AJ015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (f 

103780 AA094752 Hs.169992 hypothetical 43.2 Kd protein 

103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) m 

40 103797 AA080912 gb:zn04d03.r1 Stratagene hNT neuron (937 

103813 AI042582 Hs.181271 CGM20 protein 

103855 W02363 Hs.302267 hypothetical protemFUl 0330 

103886 AK001278 Hs.1 05737 hypothefical protein FU10416 similar to 

104052 NM.002407 Hs.97644 mammagtobin2 

45 104079 AA251242 Hs.103238 ESTs 

104174 AA478984 Hs.6451 PRO0659 protein 

104227 AB002343 Hs.98938 protocadhenn alpha 9 

104275 AI751970 Hs.101067 GCN5(generaJ<X)ntrt)lofarnino-addsynt 

104325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polyp 

50 104370 AA324597 Hs.21851 Homo sapiens cDNAFU 12900 its, done NT 

104423 R83113 Hs.1432 protein kinase C substrate 80K+I 

104482 AB037762 Hs.44268 myeSn gene expression factor 2 

104667 A1239923 Hs.30098 ESTs 

104757 A1694413 Hs.332649 olfactory receptor, family 2, subfamily 

55 104804 AIB58702 Hs.31803 ESTs, Weakly similar to N-WASP [H^apien 

104806 AB023175 Hs.22982 WAA0958 protein 

104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 

104846 AI250789 Hs.32478 ESTs 

104854 AA041276 Hs.154729 3-phospholnositide dependent protein kin 

60 104867 AA278898 Hs.225979 hypoftetical protein similar to small G 

104871 T78044 Hs.28893 Homo sapiens mRNA; cONA DKFZp56402364 (f 

104896 AW015318 Hs.23165 ESTs 

104909 AW408164 Hs.249184 transcription factor 19 (SCI) 

104916 AW958157 Hs.155489 NS1-assocfated protein 1 

65 104919 AA026880 Hs.25252 protacfin receptor 

104930 ARM3467 Hs.32893 neurexophffin 2 

104973 NM.015310 Hs.6763 WAA0942 protein 
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2.4 


? 


2.7 


other 


3.1 


other 


5.2 


? 


1.6 


other 


2.5 


TM 


45 


other 


3.1 


other 


2.4 


other 


3.5 


other 


9.9 


7 


1.3 


other 


2 


other 


1.6 


other 


2.5 


other 


2.2 


TM 


6.3 


TM 


8.8 


TM 


3 


other 


1.8 


other 


5.6 


TM 


1.9 


7 


2.5 


other 


1.6 


other 


1.8 


other 


2.3 


other 


4 


TM 


1.3 


other 


5.7 


7 


5.1 


other 


9.7 


7 


2 


other 


13 


7 


2 


other 


22 


TM 


1.3 


other 


1.3 


other 


7.6 


7 


1.3 


SSJM 


1.6 


other 


1.6 


other 


1.6 


other 


6.6 


TM 


2.9 


other 


1.4 


other 


3.0 


TM 


1.0 


nfhor 
vUtct 


D.H 


other 


C A 
%}.** 


other 


1.0 


other 


O.Z 


other 


\2 


other 


1.4 


other 


2.4 


other 


1.4 


other 


2.4 


other 


10.9 


other 


5.7 


other 


12.3 


? 


2.1 


other 


1.4 


other 


17.7 


other 


5.1 


TM 


1.8 


other 


1.5 


other 


13 


other 


5.1 


other 
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104974 Y12059 
1W975 AL136877 

104978 AI199268 

104979 AA937934 
104994 AI499930 
105009 BE379584 
105012 AF098158 
105028 AJ050715 
105041 AB037716 
105045 BE242899 
105079 AA151342 

105087 AA147884 

105088 H58589 
105095 Z78407 
105110 BE387350 

105126 AW975433 

105127 AA045648 
105141 AA164687 
105158 AW976357 
105169 BE245294 
105186 AA191512 
105254 AA071276 
105281 AA263143 
105288 N99673 
105302 AA700122 
105331 AW270037 
105359 NNL016Q15 
105366 8E264645 

105373 AW887701 

105374 BE242803 
105387 AW592146 
105393 AF167570 

105399 BE386877 

105400 AF198620 
105445 AA252395 
105507 BE268348 

105529 AA1 13449 

105530 AB023179 
105547 AA262640 
105564 BE616694 

105596 AA579535 

105597 AF054284 
105608 AI808201 
105610 AA280072 
105617 AK000892 
105620 AW302245 
105658 AA985190 
105697 AW499988 
105708 R26944 
105743 BE246502 
105746 AW151952 
105759 AI123118 
105771 AI267720 
105820 AA741336 
105826 AA478756 
105856 AI262106 
105858 AF151066 
105875 AK001708 
105930 AF016371 
106000 AW194426 
106011 AW081202 
106017 AA477956 
106073 AL157441 
106078 AA130158 
106094 AA533491 
106140 AB006624 
106271 AA251393 



Hs.278675 
Hs.50758 
Hs.19322 
Hs,321Q62 
Hs.334885 
Hs.34789 
Hs.9329 
Hs.2331 
Hs.26204 
Hs.129951 
Hs.12677 
Hs.9812 
Hs.35156 
Hs.27023 
Hs.33122 
HS.362BS 
Hs.301957 
Hs.177576 
Hs.234545 
Hs.180789 
Hs.28005 
Hs.19469 
Hs.24596 
Hs.3585 
Hs.3355 
Hs.1 79507 
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Hs.27445 
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bromodomain-containing 4 
SMC4 (structural maintenance of chromoso 
Homo sapiens, Similar to RIKEN cONA 2010 
ESTs 

mitochondrial GTP binding protein 
dolic^yMlphosphooligosaccharide-prDtel 
chromosome 20 open reading frame 1 
E2F transcription factor 5, p130-binding 
WM1295 protein 
speckle-type POZ protein 
CGM47 protein 

Homo sapiens cONA FU 14388 fo, clone HE 
Homo sapiens cONA FU11027 fis, done PL 
vesicle transport-related protein 
K3AA1 160 protein 
ESTs 

nudlx (nucleoside diphosphate linked moi 
mannosyl (alpha-l^lycoprotein beta- 
hypothetical protein NUF2R 
S164 protein 

Homo sapiens cDNA FU1 1309 fis, clone PL 

KIAA0859 protein 

RAD51 -interacting protein 

ESTs, Weakly similar to AF126743 1 DNAJ 

sentrin-spedfic protease 

WAA0779 protein 

C6I-68 protein 

hypothellcal protein FU21918 

hypothetical protein FU20628 

hypothetical protein RJ10326 

membrane protein CH1 

Interleukln enhancer binding factors, 9 

Npw38-bindlng protein NpwBP 

RNA binding motif protein 8A 

gb2s12g10^1 NCLCGAPJ3CB1 Homo sapiens 

CCR4-NOT transcription complex, subunit 

hypothetical protein FU20364 

KIAA0962 protein 

unknown 

hypothetical protein HJ14299 
hypothetical protein FU20452 
splicing factor 3b, sutmnit 1, 155kD 
hypothetical protein FU12475 
fetal Alzheimer antigen 
glucocorticoid modulatory element blndin 
casein kinase 1, gamma 2 
hypothetical protein FU20059 
zinc finger protein 278 

Homo sapiens mRNA; cONA DKFZp564M0264 (f 
sema domain, Immunoglobulin domain (lg), 
hypothetical protein FU20739 
chemoWne-tike factor, alternatively spl 
synovial sarcoma, translocated to X chro 
transcriptional unit N143 
E3ublquitinRgaseSMURF2 
ESTs 

hypothetical protein 
hypothetical protein FU10346 
peptidyl prolyl bomerase H (cyclophifin 
ESTs 

Homo sapiens, done IMAGE:2989556, mRNA, 
ESTs 

downstream neighbor of SON 
ESTs, Moderately similar to ALU8_HUMAN A 
hypothetical protein FU14681 
K1AA0286 protein 

Homo sapiens, Similar to RIKEN cONA 5430 
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KIAA1321 protein 

high-rnobffily group (nonhistone chromoso 
ESTs, Weakly similar to A54849 collagen 
cycGn B2 

transmembrane 7 superfamfly member 2 
KIAA1483 protein 

Homo sapiens cDNA: RJ21487 lis, done C 

zinc finger protein 278 

Homo sapiens cONA: FU23038 fis, done L 

ESTs 

Homo sapiens cDNA FU10071 fis, done HE 
fibufin 1 

tyrosytprotein sulfotransferase 2 

Homo sapiens mRNA; cONA DKFZp564O0122 (f 

ESTs, Moderately similar to S65657 alpha 

hyporjieticai protein FU12549 ' 

TIA1 cytotoxfcgrarujle-assodatedRNA-bi 

ESTs 

Bd-2-related ovarian killer protein-iik 
hypothetical protein RJ23293 similar to 
centrin, EF-hand protein, 3 (CDC31 yeast 
KIAA1323 protein 
hypothetical protein 

Homo sapiens, done IMAGE:3343149, mRNA. 
GL002 protein 
transcription factor BMAL2 
serum/glucocortioold regulated kinase 
hypothetical protein DKFZp434L1435 simi) 
hypothetical protein 
ESTs 

hypofoetical protein FU20727 
myeloid/lymphoid or mixed-lineage leukem 
ectonucteotide pyrophosphatase/phosphodi 
GTPase activating protein 
hypothetical protein BJ20505 
GK001 protein 

Down syndrome critical region gene 2 
ESTs 

K1AA1288 protein 
ESTs 



leukemia-associated phosphoprotein p18 ( 
ESTs, Moderately similar to I38759 zinc 



translocase of outer mitochondrial membr 
ESTs, Moderately similar to ALU7 HUMAN A 
hypotheticaJ protein MGC4606 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
zinc finger protein 193 
TATA element modulatory factor 1 
kinesin family member 4A 



ESTs 

ESTs, Moderately similar to ALU7 JiUMAN A 
ESTs, Weakly similar to ALU1 JWMAN ALU S 
potassium voltage-gated channel, delayed 
keratin 6B 

Ig superfamtfy receptor LNIR 

pyruvate dehydrogenase kinase, isoenzyme 

SWUSNF related, matrix assodated. acO 

ESTs 

F-box only protein 5 

ESTs 

ESTs 

Homo sapiens cONA FU12758 fis, clone NT 

ESTs 

ESTs 
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108647 BE546947 Hs.44276 homeo box C10 

108695 AB029000 Hs.7Q823 K1M1077 protein 

108740 AI089575 Hs.9071 progesterone membrane binding protein 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 

5 108859 AL121500 Hs.178904 ESTs 

108872 H06720 Hs.111680 endosuifine alpha 

108891 A1801235 Hs.48480 ESTs 

108894 AK001431 Hs.5105 hypothetical protein FU 10569 

108955 AA149754 Hs.195155 Homo sapiens amino add transport system 

10 108382 AA151708 Ms. 17 1350 homeo box (expressed In ES cells) 1 

108987 AA1 52178 Hs.23467 hypothetical protein FU10633 

109002 A8028987 Hs.72134 K1AA1064 protein 

109011 AA156542 Hs.72127 ESTs 

109026 AA157811 gb2o35d07.s1 Stratagene colon (937204) 

15 109068 AA164293 Hs.72545 ESTs 

109101 AW608930 Hs.52184 hypothetical protein RJ2061 8 

109112 AW419196 Hs.257924 hypothetical protein FU13782 

109124 AKD00684 Hs.183887 hypothetical protein FU22104 

109139 AJ132592 Hs.59757 zinc finger protein 281 

20 109166 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 

109198 BE566742 Hs.58169 highly expressed in cancer, rich in leuc 

109213 NNL016603 Hs.82035 potential nuclear protein C50RF5;GAP-D 

109220 AW958181 Hs.189998 ESTs 

109233 AU077281 Hs.170285 nucleoporin 214kD (CAIN) 

25 109270 N99673 Hs.3585 ESTs, Weakly simflar to AF126743 1 DNAJ 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (f 

109313 AF1532Q1 Hs.86276 C2H2 (KruppeUype) zinc finger protein 

109341 AA213506 Hs.115099 EST 

109391 AL096858 Hs.184245 KIAA0929 protein Msx2 Interacting nudea 

30 109420 H83603 Hs.40408 homeo box C9 

109426 N30531 Hs.42215 protein phosphatase 1 , regulatory subuni 

109429 AI160029 Hs.61438 ESTs 

109445 AA232103 Hs.189915 ESTs 

109450 AB032969 Hs.173042 KIAA1143 protein 

35 109468 NIVL015310 Hs.6763 KIAA0942 protein 

109478 AW074143 Hs.87134 ESTs 

109570 L40027 Hs.118890 glycogen synthase kinase 3 alpha 

109662 F02614 Hs.27319 ESTs 

109825 R71264 Hs.16798 ESTs 

40 110039 H11938 Hs.21907 histone acetyttransferase 

110056 AA503041 Hs.279009 matrix 61a protein 

110085 AA603840 Hs.29956 KIAA0460 protein 

110110 T07353 HS7948 ESTs 

110129 R51853 Hs.226429 ESTs, WeaWy similar to ALU1_HUMAN ALU S 

45 110154 NM.014521 Hs.17667 SH3-domain binding protein 4 

110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 

110242 N41744 Hs.19978 CGI-30 protein 

110259 H28428 Hs.32406 ESTs, Weakly similar to 138022 hypotheti 

110312 BE256986 Hs.11896 hypothetical protein FU 12089 

50 110501 H55748 gb:yq94a01.s1 Soares fetal Ever spleen 

110504 H55915 Hs.210859 hypothetical protein FU1 101 6 

110525 H57330 Hs.37430 EST 

110568 AK001160 Hs.5999 hypothetical protein FU10298 

110699 T97586 Hs.18090 ESTs s 
55 110705 AB007902 Hs.32168 KIAA0442 protein J 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 

110761 AL138077 Hs.16157 hypothetical protein FU12707 

110762 BE044245 Hs.30011 hypothetical protefri MGC2963 
110765 AK000322 Hs.18457 hypothetical protein FU20315 

60 110769 BE000831 Hs.23837 Homo sapiens cDNA FU11812 lis, done HE 
110799 A1089660 Hs.323401 dpy-304ike protein 
110805 T25829 Hs.24048 FK506 binding protein precursor 
110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU 1_HUMAN A 
110820 R33261 Hs.6614 ESTs, WeaWy similar to A43932 mucin 2 p 

65 110840 N31598 Hs.12727 hypothetical protein FU21 610 

110844 AI740792 Hs.1 67531 methylcrotonoyl-Coenzyme A carboxylase 2 
110854 BE612992 Hs.27931 hypothetical protein FU10607 similar to 
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110856 AA992380 gb:ot37g06.s1 Scares JestisJJHT Homo sap 

110885 BE384447 Hs. 16034 hypothetical protein MGC1 31 86 

110897 AL117430 Hs.6B80 DKFZP434D1 55 protein 

110915 BE092285 Hs.29724 hypothetical protein FU1 3187 

5 110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expr 

110958 NMJJ05864 Hs.24587 signal transduction protein (SH3 contain 

110963 AK002180 Hs.11449 0KFZP564O123 protein 

110981 AKQ01980 Hs.24284 ADP-ribosyttransferase (NAD+; poly(ADP-r 

110984 AW613287 Hs.80120 UDP^cetyl^rtfhD^alactDsaminepolyp 

10 111125 N63823 Hs.269115 ESTs, Moderately similar to Z195_HUMAN Z 

111132 AB037807 Hs.83293 hypothefcal protein 

111164 N46180 Hs.122489 Homo sapiens cDNA FU13289 fis, clone OV 

111172 R67419 Hs.21851 Homo sapiens cONA FLJ12900 fts, clone NT 

111174 AL050166 Hs.26295 Homo sapiens mRNA; cDNA DKFZp586D1122 {f 

15 111179 AK000136 Hs.10760 asporin (LRR class 1) 

111184 AI815486 Hs.243901 Homo sapiens cDMA FU20738 fo, done HE 

111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse t 

111216 AW139408 Hs.1 52940 ESTs 

111221 AB037782 Hs.15119 KIAA1361 protein 

20 111223 AA852773 Hs.334838 K1AA1866 protein 

111239 N90956 Hs.17230 hypothetical protein RJ22087 

111285 AA778711 Hs.4310 eukaryotic translation initiation factor 

111299 AB033091 HsJ4313 KIAA1265 protein 

111312 AI523913 Hs.34504 ESTs 

25 111318 T99755 Hs.334728 ESTs 

111337 AA837396 Hs.263925 US1 -interacting protein NUDE1, rat homo 

111352 H58589 Hs.35156 Homo sapiens cDNA FLI11027 fts, clone PL 

111370 AI478658 Hs.94631 brefetdin A-inhibited guanine nucleotide 

111384 N94606 Hs.288969 HSCARG protein 

30 111389 AK000987 Hs.169111 oxidalion resistance 1 

111452 R02354 Hs.15999 ESTs 

111486 AI051194 Hs.227978 EST 

111549 W90638 Hs.20321 ESTs.MaieraterysirnaartoZRFIJIUMANZ 

111585 R10720 Hs.20670 EST 

35 111627 R52656 Hs.21691 ESTs 

111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cONA DKFZp564D016 (tr 

111944 AW083791 Hs.21263 suppressor of potassium transport defect 

111987 NMJ>15310 Hs.6763 KIAA0942 protein 

40 112134 R41823 Hs.7413 ESTs; cafeyntenin-2 

112244 AB029000 Hs.70823 K1AA1077 protein 

112388 R46071 Hs.301693 Homo sapiens, done IMAGE:3638994, mRNA, 

112456 NMJJ16248 Hs.232076 A kinase (PRKA) anchor protein 11 

112464 AW007287 Hs.28538 Homo sapiens cDNA: FU21Q86 fis, clone C 

45 112506 AI742756 Hs.26079 ESTs 

112513 R68425 Hs.13809 hypothetical protein FU10648 

112752 AK001635 Hs.14838 hypothetical protein FU 10773 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein, 

112923 T10258 Hs.5037 EST 

50 112936 AW970326 Hs.6185 KIAA1 557 protein 

112958 R61388 Hs.6724 ESTs 

112966 Z44718 Hs.102548 glucocorticoid receptor DNA binding fact 

112978 AK000272 Hs.7099 hypothetical protein FU20265 

112995 AA737033 Hs.7155 ESTs, Moderately slmDar to 21 15357A TYK 

55 112995 BE276112 Hs.7165 - zinc finger protein 259 
113047 AI571940 Hs.7549 ESTs 

113049 AW965190 Hs.7560 Homo sapiens mRNA for WAA1729 protein, 
113069 T40707 Hs.270862 ESTs 

113196 T57317 gb:yb51a03.s1 Stratagene fetal spleen (9 

60 113248 T63857 gb.yc16e01.s1 Stratagene lung (937210) H 

113254 AK002180 Hs.11449 OKFZP5640123 protein 

113277 AW971049 Hs.11774 proteh(peptidy^rylcis/barBisorne 

113429 AA688021 Hs.179808 ESTs 

113499 AMS7908 Ks.8882 ESTs 
65 113547 H59588 Hs.15233 ESTs 

113647 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fls, done MA 

113702 T97307 gb7e53h05^1 Soares fetal liver spleen 
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TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigeneED's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



102481 31281_-28 U50360 
20 105032 genban!eAA127818 AA127818 

409487 1134778J H19886AW402806T10231 
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Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: 

ExAccn: 

UnigenelD; 

Unigene Title: 

R1: 

R2: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of tumor to normal body tissue 

Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title 

00075 AF152333 Hs.284160 protocadherin gamma subfamily B, 4 

00229 AV652249 Hs.180107 polymerase (DNA directed), beta 

00262 D38500 Hs.278468 posfrnefotic segregation Increased 2-Bke 

00271 BE160081 Hs.256290 S100 calcium-binding protein A1 1 (calgiz 

00355 AI907114 Hs.71465 squalene epoxidase 

00522 X51501 Hs.99949 prolactin-induced protein 

I00552 AA019521 Hs.301946 lysosomal 

00599 X77343 Hs.334334 transcription factor AP-2 alpha (activat 

00676 X02761 Hs.287820 fibronectin 1 

00690 AA383256 Hs.1657 estrogen receptor 1 

00895 U01351 Hs.75772 nuclear receptor subfamily 3. group C.m 
01046 K01160 NMj002122:Homo sapiens major histocompatl .7 

01086 AA382524 Hs.250959 histailn 1 

01148 NfVL002923 Hs.78944 regulator of (^protein signalling 2,24k 

01161 NMJW6262 Hs.37044 peripheral 

01201 L22524 Hs.2256 matrix metalloproteinase 7 (MMP7; uterin 

01212 AI186220 Hs.83164 collagen, type XV, alpha 1 

01441 AW468397 Hs.100000 S100 (^nvbinding protein A8 (calgran 
01447 M21305 gfcHuman alpha satellite and satellite 3 

101469 AA310162 Hs.169248 cytochrome c 

101567 M33552 Hs.56729 lysosomal 

01600 BE561617 Hs.1 19192 H2Ahistone family, member Z 
01624 M55998 gb:Human aIpha-1 collagen type I gene, 3 

01674 NMJXJ2291 Hs.82124 laminin, beta 1 

01861 AA350659 Hs.83347 angkwssoclated, migratory ceil protein 

01977 AF112213 Hs.184062 putative Rab5-lnteracling protefo 

02193 AL036335 Hs.313 secreted phosphoprotein 1 (osteopontin, 

02199 AA334592 Hs.79914 kimlcan 

02304 AF015224 Hs.46452 mammagbblnl 

02345 NMJ03480 Hs.300946 ftoofibnT^ssoclated grycoprotein-2 

02457 NM.001394 Hs.2359 dual specificity phosphatase 4 

02534 U96759 ris.198307 von KppeMJndau binding protein 1 

102541 AI379954 Hs.79025 KIAA0096 protein 

I02827 BE244588 Hs.6456 chaperonin containing TCPl.subunit 2 (b 

02962 R50032 Hs.1 59263 collagen, type VI, alpha 2 

02991 AW293542 Hs.75309 eukaryoOc translation elongation factor 

03119 X63629 Hs.2877 cadherin 3, type I.P-cadherin (placenta 

03175 X69089 Hs.79227 myomesin (Acrolein) 2 (165kD) 

03286 D38616 Hs.54941 phosphorytase kinase, alpha 2 (liver) 

03319 X83492 Hs.82359 tumor necrosis factor receptor superfami 

03372 BE536700 Hs.4888 seryMRNA synthetase 

03419 T34708 Hs.272927 Sec23 (S. cerevfclae) homotog A 

03471 Y00815 Hs.75216 protein tyrosine phosphatase, receptor t 

03546 Z14244 Hs.75752 cytochrome c oxidase subunit Vilb 
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s>.0 


OR 


A A 


1 0 
o.c 


c.o 


o.o 


1 A 

l.*t 


11.9 


0.4 


3.8 


12 


9,4 


9.4 


3 


7.8 


4.4 


4.4 


1 


3.9 


41.7 


4 


0.8 


4.1 


\2 


12 


3.1 


1.1 


4.4 


0.6 


3.1 


3.4 


0.9 


4.2 


29.9 


0.3 


0.8 


4.9 


1 


5.9 


2.8 


4 


3.1 


1.7 


1.5 


4.1 


3.1 


1.4 


1.3 


6.9 


1.9 


4.9 


22 


3.8 


*2 


0.7 


1.1 


4.2 


4,5 


0.5 


1.4 


42 


0.9 


3.9 


1.5 


10.9 


2.2 


6.2 


5.6 


5.7 


3.7 


0.5 


1.3 


4 


1.3 


3.8 


0.8 


4.6 


0.9 


8 


1.1 


5.1 
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103658 


NM.000088 


Hs.172928 collagen, type I, alpha 1 


3.2 


3 


103758 


AA084874 


gb2n13e04.r1 Stratagene hNT neuron (937 


0.9 


10 


103774 


H24185 


Hs.92918 hypothetical protein 


1.9 


15.9 


103821 


AA095971 


Hs.198793 Homo sapiens cONA; FU22463 fis, clone H 


1.2 


3.9 


103869 


BE439604 


Hs.24322 ATPase, H+ transporting, lysosomal (vacu 


1.4 


3.9 


103980 


AW130242 


Hs.293476 hypothetical protein FKSG44 


1.6 


4.1 


104054 


AK001913 


Hs.7100 hypothetical protein 


1.5 


4.3 


104115 


AF183810 


Hs.26102 opposite strand to trichominophalangeal 


7 


7 


104189 


AB040927 


Hs.301804 KIAA1494 protein 


2 


4.6 


104230 


AB002347 


Hs.15303 KIAA0349 protein 


0.7 


4.5 


104278 


AW583693 


Hs.109253 N-termlna! acetyl transferase complex ard 


3.3 


3.3 


104295 


AW365522 


Hs.103657 hypothetical protein PR02219 


2.3 


4.2 


104319 


AW804296 


Hs.9950 Sec61 gamma 


3.1 . 


7 


104425 


AF283775 


Hs.35380 x 001 protein 


4 


1.3 


104432 


X51501 


Hs.99949 prolactin-induced protein 


3.8 


0.6 


104464 


AW966728 


Hs.54642 methionine adenosyltransferase 11, beta 


0.8 


6.7 


104479 


AKD01731 


Hs.106390 Homo sapiens mRNA; cONA DKFZp586H0924 (f 


1.7 


104505 


W94824 


Hs.11565 RIKEN cONA 2010100012 gene 


2 


7.5 


104592 


AW630488 


Hs.325820 protease, serine, 23 


1.9 


7.4 


104613 


AF123303 


Hs.24713 hypothetical protein 


1.1 


6.3 


104636 


R82252 


Hs.1 06106 protein kinase (cAMP-dependent, cataiyti 


1.2 


4 


104782 


AW270555 


Hs.171774 hypothetical protein 


1.4 


3.9 


104792 


AA960961 


Hs.305953 zinc finger protein 83 (HPF1) 


1.5 


42 


104848 


AA305351 


Hs.274369 uncharacterized hypothalamus protein HAR 


1.1 


4.1 


104849 


AI279065 


Hs.241507 ribosomal protein S6 


1.3 


4.6 


104850 


AL 133035 


Hs.8728 hypothetical protein OKFZp434G171 


1.2 


3.6 


104852 


W70164 


Hs.20107 ESTs 


0.8 


4.2 


104861 


AA058630 


Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 


1.7 


104873 


W03831 


Hs.20597 host cell factor homotog 


0.8 


5.4 


104891 


W44626 


Hs.30627 ESTs 


0.7 


6.8 


104920 


AW955089 


Hs.306083 Novel human gene mapping to chomosome 22 


1 3.9 


104926 


BE298808 


Hs.33363 DKFZP434N093 protein 


3.3 


3.3 


104952 


AW076098 


Hs.74316 desmopiakin(DPI,DPIi) 


1.2 


3.7 


104963 


AB029020 


Hs.173694 WAA1097 protein 


1.1 


5.5 


104977 


A1392640 


Hs.18272 amino add transporter system A1 


3.2 


1.4 


105030 


BE613061 


Hs.337772 Homo sapiens, Similar to RiKEN cONA 0610 1.6 


11.4 


105035 


N39760 


Hs.8859 Homo sapiens, SimOar to RiKEN cONA 5830 1.5 


7.2 


105068 


BE410438 


Hs.9006 VAMP (veside-associated membrane protei 


1.1 


3.5 


105159 


AF146277 


Hs.265561 CD2-assodated protein 


1.2 


10 


105178 


AA313825 


Hs.21941 AD036 protein 


3.6 


8.3 


105182 


BE407961 


Hs.18271 golgi phosphoprotein 3 


1.7 


6.8 


105274 


A1554929 


Hs.281866 ATPase, H+ transporting, lysosomal (vacu 


1.1 


3.7 


105303 


BE243327 


Hs.182626 chromosome 22 open reading frame 5 


1.5 


4 


105413 


AI015709 


Hs.172089 Hornosapier«mRNA;cDNADKF2p586l2022(f 


1.5 


105426 


W20027 


Hs.23439 ESTs 


4.3 


2.9 


105432 


W03516 


Hs.76698 stress-assodated endoplasmic reticulum 


1.5 


5 


105443 


AA252372 


Hs.12144 KIAA1033 protein 


1.2 


3.6 


105483 


AL1 37257 


Hs.23458 Homo sapiens cONA: FU23015 fis, clone L 


1.7 


15.8 


105492 


AI805717 


Hs.289112 C6M3 protein 


2 


4.8 


105495 


AL037715 


Hs.28785 rnicrofibriliar-assoctated protein 3 


13 


3.9 


105539 


AB040884 


Hs.109694 K1AA1451 protein 


2.7 


11.4 


105594 


AB024334 


Hs.25001 tyrosine 3-monooxygenaseitryptophan 5-mo 


1.3 


6.1 


105623 


8E504200 


Hs.30127 hypomefica! protein 


1J 


4.5 


105807 


AA788946 


Hs.16869 ESTs, Moderately similar to CA1C RAT COL 


3.9 


24.6 


105812 


BE614149 


Hs.20814 CGI-27 protein 


1.8 


3.6 


105823 


AI559444 


Hs.293960 ESTs 


1.9 


6.6 


105831 


AA329449 


Hs.247302 twisted gastrulation 


1.5 


4.3 


105851 


A1827976 


Hs.24391 hypothetical protein FU13612 


3.8 


1.9 


105879 


BE392914 


Hs.30503 Homo sapiens cONA RJ 11344 fis, done PL 


1.7 


4 


105918 


AW028485 


Hs.26136 hypothetical protein MGC14156 


1.7 


7.4 


105939 


AL137728 


Hs.12258 Homo sapiens mRNA; cDNA DKFZp434B0920 (f 


1.2 


105941 


AB033075 


Hs.10669 o^vetopmentanddin^tiationenhandn 


1.3 


4.6 


105969 


AB030658 


Hs.17377 coronln.actin-binding protein, 1C 


1.1 


5.9 


105990 


AI690586 


Hs.29403 hypothetical protein RJ22060 


2 


4.6 


106012 


A1240665 


Hs.8895 ESTs 


4.1 


1.2 


106060 


NMJJ01329 


Hs.171391 C-termlrtal binding protein 2 


2.6 


7 


106070 


T74445 


Hs.5957 Homo sapiens done 24416 mRNA sequence 1.4 


10.7 



4.8 



5.1 



14 



3.8 
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106083 


H62087 


Hs.31659 thyroid hormone receptor-associated prot 


1.5 


3.6 


106155 


AA425414 


Hs.33287 nuclear factor l/B 


5.4 


1.2 


106255 


BE613206 


Hs.279607 calpastatin 


1.8 


4 


106414 


BE568205 


Hs.28827 mitog en-activated protein kinase kinase 


5.1 


6.1 


106538 


AK000274 


Hs.278635 HDCMA18P protein 


1.2 


5.9 


106568 


AW051564 


Hs.28285 patched related protein translocated in 


1.8 


5.4 


106574 


BSM4325 


Hs.227280 U6 snRNA-assoclated Sm-Gke protein 


2.3 


112 


106613 


N88604 


Hs.30212 thyroid receptor interacting protein 15 


1.2 


3.6 


106617 


H09548 


Hs.5367 ESTs, Weakly similar to I38022 hypotheti 


0.9 


4.4 


106619 


AA459480 


Hs.23956 hypothetical protein FU20502 


1.3 


3.6 


106701 


BE387614 


Hs.25797 spficing factor 3b, subunit 4, 49kD 


1.6 


7.3 


106721 


AA741038 


Hs.6670 ESTs 


1.7 


6.1 


106776 


AA206079 


Hs.6693 hypothetical protein FU20420 


1 


5.4 


105866 


AA487416 


Hs.268231 Homo sapiens cDNA: FU231 1 1 0s, clone L 


1.6 


5.4 


106868 


BE185536 


Hs.301183 molecule possessing ankyrin repeats indu 


3.3 


1.2 


106887 


BE503373 


Hs.334335 hypothetical protein FU13576 


1.4 


6.3 


105940 


T85594 


Hs.339808 hypothefical protein FU10120 


3.3 


1.8 


106968 


AF216751 


Hs.26813 CDA14 


3 


3 


107052 


BE391904 


H s. 1 2482 g iyceronephosphate Oacy Itransferas e 


1.7 


7.6 


107061 


BE147611 


Hs.6354 stroma) cell derived (actor receptor 1 




4.3 


107149 


AI289507 


Hs.299883 hypothefical protein FU23399 


U 


6.5 


107222 


BE17205B 


Hs.82689 tumor rejection antigen (gp96) 1 


\2 


6.9 


107233 


BE267795 


Hs.22595 hypothetical protein FU10637 


1.4 


3.5 


107295 


AA186629 


Hs.80120 UDP^-aceryValpha-0^aladosamIne:po!yp 


2.6 


4.3 


107679 


AA011510 


Hs.60512 ESTs 


1.8 


4 


107914 


AA027229 


Hs.61329 ESTs, Weakly similar to T16370 hypotheti 


1.3 


3.5 


107965 


AF109219 


Hs.108787 phosphatidyDnositDl glycan. class N 


1.6 


3.5 


108033 


AW368993 


Hs.323748 Homo sapiens clone CDABP0086 mRNA sequen 


1.8 


108060 


AA291440 


Hs.731 49 paired box gene 8 


1.1 


3.5 


108081 


AA093668 


Hs.28578 muscleblind (DrosophilaHike 


0.7 


5.6 


108137 


AI283611 


Hs.263479 ESTs, Weakly similar to HMG1JWMAN HIGH 


12 


108186 


AW068579 


Hs.7780 Homo sapiens mRNA; cONA DKFZp564A072 (fr 


3.1 


108215 


A1879238 


Hs.299315 coilapsln response mediator proteh-5; C 


1.5 


4.6 


108297 


AA333660 


Hs.71331 hypothetical protein MGC5350 


1.5 


4 


108339 


AW15i340 


Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALU S 


6.3 


108371 


AA074374 


Hs.67639 ESTs 


1.3 


3.8 


108399 


AF0B6070 


Hs.237519 EST 


1 


3.6 


108469 


AA079487 


gbzm97t08.s1 Stratagene colon HT29 (937 


1.5 


3.6 


108470 


AA079500 


gb:zm96h10.s1 Stratagene colon HT29 (937 1.1 


4.3 


108564 


M23114 


Hs.1526 ATPase, Ca++ transporting, cardiac muscl 


2 


4.9 


108641 


AA1 12059 


Hs.429 ATP synthase, H+ transporting, mltochond 


1.1 


3.5 


108668 


AA058522 


Hs.185751 ESTs 


U 


3.6 


108694 


AA036725 


Hs.61847 ESTs 


1.4 


3.6 


108824 


AK001332 


Hs.44672 hypothetica! protein FU10470 


1.4 


3.5 


108863 


AA1 33456 


Hs, 102548 glucocorticoid receptor DhM landing fact 


1.2 


4 


108893 


BE276891 


Hs.194691 retinoic acid induced 3 


1.3 


3.6 


108992 


AA152312 


Hs.72047 ESTs 


1.1 


4.1 


109072 


AI732585 


Hs.22394 hypothetical protein FU10893 


U 


3.5 


109097 


AA1 67512 


gb:zp10f12s1 Stratagene fetal retina 93 


1.3 


5 


109160 


BE220601 


Hs.301997 hypothetical protein FU13033 


4 


6.1 


109244 


BE179030 


Hs.64239 Human DNA sequence from clone RP5-1174N9 


1.7 


109481 


AA878923 


Hs.289069 hypothetical protein FU2101 6 


3.8 


7.7 


109484 


AA366263 


Hs.72531 hypothetical protein RJ1 1838 


1.9 


4 


109795 


AA173942 


Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (f 


3.7 


110009 


BE075297 


Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 


4.6 


7.4 


110107 


AW151660 


Hs.31444 ESTs 


\2 


3,5 


110411 


AW001579 


Hs.9645 Homo sapiens mRNA for KIAA1741 protein, 


3.7 


3.3 


110731 


NM.014899 


Hs.188006 WAA0878 protein 


Z8 


3.7 


110756 


N21207 


Hs.182999 ESTs 


1.6 


3.5 


110930 


BE242691 


Hs.14947 ESTs 


3.1 


1.2 


110935 


AI753230 


Hs.323562 hypofaetical protein OKFZp564K142 


1.9 


7.5 


111051 


AI681293 


Hs.12186 hypo&ietical protein FU22558 


2 


4 


111110 


AK001566 


Hs.23618 hypothetical protein FU10704 


1.1 


3.8 


111356 


BE301871 


Hs.4867 mannosyl (alpha-1 ^-glycoprotein beta- 


1 


8.2 


111357 


BE314949 


Hs.87128 hypothetical protein FU23309 


3.3 


6.1 


111770 


R27975 


Hs^69401 ESTs,^fode^aterysMa^toS65657ali3ha 


12 


5.4 


111900 


AF131784 


Hs.25318 Homo sapiens clone 25194 mRNA sequence 3.2 


0.8 



8.1 



5.6 
6.9 



4.7 



7.4 
1.3 
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111903 NMJ14906 Hs.166351 KIAA1072 protein 1 5.4 

111951 NNL014927 Hs.100527 KIAA0902 proteJn 1 3.8 

112141 AW137198 Hs.278682 Phosphatidylglycerophosphate Synthase 1.4 3.5 

.112193 R49499 Hs.138238 ESTs 1.5 3.6 

5 112197 NMJ03655 Hs.5637 ESTs 4.6 2 

112610 AW500106 Hs.23643 serine/threonine protein kinase MASK 3.3 10.5 

112971 Z42387 Hs.83883 transmembrane, prostate androgen induced 3.2 3 

112984 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.7 10.8 

113056 AF019226 Hs.8036 glioblastoma overexpressed 4.5 3.7 

i\j i Avvtouooj na. uouuu uypuuicuucu piuicm i.c 

113497 AF143321 Hs.15572 hypothetical protein IMAGE 109914 0.9 3.6 

113508 AL042936 Hs.211571 holocytochrome c synthase (cytochrome c 1.1 3.5 

113531 AK001898 Hs.16740 hypothetical protein RJ1 1036 1.2 3.9 

113604 AI075407 Hs.296083 ESTs, Moderately similar to 154374 gene 1.7 5.3 

15 113674 NMJ)14214 Hs.5753 lnosKol(myo)-1(or4Hnonophosphatase2 0.8 6,1 

113841 W30681 Hs.146233 Homo sapiens cONA: RJ22130 fis, clone H 1.7 6.2 

113857 AW243158 Hs.5297 DKFZP564A2416 protein . 1.2 4.6 

113931 BE255499 Hs.3496 hypothetical protein MGC15749 1.5 4 

113936 W17056 Hs.83623 nuclear receptor subfamily 1, group I, m 3.8 1 

20 113987 AA345519 Hs.9641 comptementcomponentl.q subcomponent 1.2 4.7 

114132 AI342493 Hs.24192 Homo sapiens cDNA FU20767 fis, clone CO 0.3 4.3 

114156 BE179882 Hs.336920 glutathione peroxidase 3 (plasma) 1.1 4.3 

114213 N58309 Hs.19575 CGI-11 protein 1.6 9.2 

114636 AA075488 gb:zmB8d01.s1 Stratagene ovarian cancer ' 1.6 3.7 

25 114760 A1929382 Hs.252692 hypothetical protein RJ20343 1.4 4 

114781 T10446 Hs.95388 ESTs 1 4.3 

'114795 AB037858 Hs.173484 hypothetical protein FU10337 1.6 9.2 

114901 AV660012 Hs.196437 hypothetical protein FU10788 1.4 5.2 

115096 AI683069 Hs.175319 ESTs 3.7 1 

30 115518 BE541042 Hs.23240 Homo sapiens cDNA: FU21848 fis, done H 3.2 4.2 

115646 N36110 Hs.305971 solute carrierfamily 2 (facifitated gtu 1.5 ,3.9 

115764 AW582256 Hs.91011 anterior gradient 2 (Xenepus laevis) hom 1.3 5.9 

115802 AW410233 Hs.206521 YME1 (S.cerevlsiae)-like 1 1.7 6.6 

115994 AB037836 Hs.109315 K1AA1415 protein 1.5 9.1 

35 116032 BE383668 Hs.42484 hypothetical protein FU1061 8 0.9 4.3 

116046 BE395293 Hs.94491 hypothetical protect RJ20297 1.6 5.5 

116274 A1129767 Hs.1 82874 guanine nucleotide binding protein (Gpr 3.2 2.4 

116310 Z24854 Hs.42299 ESTs 0.8 4.7 

116356 A1371223 Hs.288671 Homo sapiens cDNA FU1 1997 fis, clone HE 2.4 3.9 

40 116429 AF191018 Hs.279923 putative nucleotide binding protein, est 5.5 5.5 

116461 AA313607 Hs.58633 Homo sapiens cDNA: FU22145 fis, done H 5 1.3 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 8.7 4.5 

116578 D21262 Hs.75337 nudedararjdwilea^booVphosphprotein 3.2 6.9 

116579 AW888411 Hs.81915 leukemia-associated phosphoprotein p18 ( 3.2 3 
45 116589 AI557212 Hs.17132 ESTs, Moderately slmflar to 154374 gene 3.1 8.3 

116786 H25836 Hs.301527 ESTs, Moderately similar to unknown [H.s 3.2 4.5 

117170 N25929 Hs.42500 ADP-ribosytation factor-like 5 7 5.5' 

117215 N20066 Hs.133207 PTPRF Interacting protein, binding prote 1.2 6.2 

1 17280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, done C 4.5 2.4 

50 117576 AI383467 Hs.44597 ESTs 1.4 4.2 

117667 U59305 Hs.44708 Ser-Tlir protein kinase related to the my 4.3 0.5 

117881 AF161470 Hs.260622 butyrate-induced transcript 1 2.1 5.7 

118336 BE327311 Hs.47166 HT021 3.6 .7.7 

118475 N66845 gb:za46c11.s1 Soares fetal fiver spleen 4.2 0.5 

55 118493 AL353944 Hs.50115 Homo sapiens mRNA;cDNADKFZp761J11 12 (f 3.5 3.3 

118505 N67343 gb:yz50b07.s1 Morton Fetal Cochlea Homo 2.1 3.8 

119159 AF142419 Hs.15020 homolog of mouse quaking OKI (KH domain 3.7 1.5 

119307 BE048061 Hs.37054 ephrin-A3 3 1.1 

119355 BE218319 Hs.5807 GTPaseRab14 1.1 5.6 

60 119462 BE041667 Hs.314544 Homo sapiens cervical cancer suppressor- 1.4 4.3 

119771 AI905687 Hs.2533 EST 3.2 1 

119940 AL050097 Hs.272531 DKFZP586B0319 protein. 4.3 0.7 

119943 BE565849 Hs.14158 coplne 111 3.5 1.9 

120407 AA235207 ■ Hs.250456 hypothetical protein DKFZp762F2011 1.5 3.7 

65 120493 AW968080 Hs.152939 Homo sapiens done 24630 mRNA sequence 4 1.4 

120677 AF150208 Hs. 1 08327 damage-specific DNA binding protein 1 (1 1.6 6.8 

120867 AA350781 Hs.96967 ESTs 1.1 3.6 
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121368 BE262956 
121603 AA415785 
121723 AA243499 
122223 AF169797 
122378 AB032948 
122946 A1718702 
123155 AF121856 
123158 AF161426 
123327 AA421581 
123495 W28673 
123526 AA608657 
123533 AA608751 
123768 AI932318 
123961 AL050184 

123999 AF084555 

124000 BE563957 
124038 AB037860 
124059 BE387335 
124083 AW195237 
124148 BE300094 
124153 AU077333 
124252 BE613340 
124314 AK001552 
124375 D87454 
124432 N39016 
124447 N48Q00 
124539 D54120 
124543 AI393320 
124564 H66409 
124574 AL036596 
124605 AA749315 
124639 H60193 
124659 AI680737 
124737 BE270465 
124760 AW408586 
124763 BE410405 
124792 R44357 
124842 R56485 
124940 AF068846 
124949 A1903210 
124960 AL023513 
124995 T52700 
125030 AA610577 
125034 BE548446 
125058 T83731 
125076 AA973971 
125090 T91518 
125103 AA570056 

125154 W38419 

125155 AA837043 
125159 AK000669 
125170 AL020996 
125181 R40815 
125193 W67577 
125260 H05635 
125262 AW884980 
125272 BE612888 
125388 W27235 
125824 Z45258 
125852 AW630088 
125970 AW504721 
126192 AW160399 
126469 BE384361 
126510 AA057593 
127095 AA340277 
127524 AI243596 
128312 J04182 



Hs.178292 

Hs.249495 

Hs.104800 

Hs.27413 

Hs.21356 

Hs.308026 

Hs.284291 

Hs.218329 

Hs.178443 

Hs.106747 



Hs.188762 

Hs.21610 

Hs.7351 

Hs.74861 

Hs.173933 

Hs.283713 

Hs.7734 

Hs.227751 

Hs.160483 

Hs.334725 

Hs.215766 

Hs.192966 

Hs.268869 

Hs.146409 

Hs.104573 

Hs.108275 

Hs.42322 

Hs.77171 

Hs.21143 

Hs.289068 

Hs.78793 

Hs.91052 

Hs.76288 

Hs.48712 

Hs.103804 

Hs.336780 

Hs.194766 

Hs.1 10044 

Hs.187775 

Hs.5167 

Hs.3343 



Hs.122730 

Hs.143669 

Hs.274428 

Hs.8518 

Hs.12396 

Hs.84298 

Hs.294030 

Hs!171957 

Hs.180224 

Hs.64311 

Hs.286013 

Hs.76550 

Hs.177516 

Hs.30376 

Hs.182885 

Hs.334762 

Hs.10248 

Hs.94830 

Hs.150101 



K1AA0180 protein 

heterogeneous nuclear ribonucleoprotein 
hypothetical protein FU10134 
adaptor protein containing pH domain, PT 
hypothetical protein OKFZp762K2015 
major histocompatibility complex, class 
sorting nexin 6 



ESTs 

serine carboxypepfidase 1 precursor prot 
gfcae55d04.s1 Stratagene lung carcinoma 
gb:ae56h07.s1 Stratagene lung carcinoma 



1.5 
2.2 
3.4 
3.9 
1.4 
1.4 
1.2 
2.4 
0.9 
1.3 
2.1 
2.1 



ESTs, Moderately similar to H2BL_HUMAN H 1.1 

DKFZP434B203 protein 1.1 

cyclic AMP phosphoprotein, 19 kD 1.4 

activated RNA polymerase II transcriptio 1.9 

nuclear factor l/A 1.5 

ESTs, Weakly similar to S64054 hypotheti 14.8 

hypothetical protein FU22174 1.2 

lectin, galactoskJe-blnding, soluble, 1 2.5 

erythrocyte membrane protein band 7.2 (s 1 

Homo sapiens, Similar to RIKENcDNA 9430 1.5 

GTP-binding protein 1.8 

KIAA0265 protein 1.1 

ESTs, Weakly similar to ALUC.HUMAN IJ1I 1.3 

gb:yy98e12.s1 Soaresjniiluplejsderosis_ 2.7 

ceil division cycle 42 (GTP-bWing prot 2.1 

ESTs 1 

ESTs 1.4 

A kinase (PRKA) anchor protein 2 0.7 

miru chromosome maintenance deficient (S. 1.1 

DKFZP586C1324 protein 1.4 

Homo sapiens cONA FU1 1918 fis, clone HE 1.5 

protein kinase C, zeta 0.7 

ESTs.'Moderately similar to ALU5_HUMAN A 0.9 

calpain 2, (m/ll) large subunit 1.3 

hypothetical protein FU20736 1.8 

gb:yg93h09.s1 Soares Infant brain 1NIBH 1 

heterogeneous nuclear ribonucleoprotein 3.2 

tubufin, beta porypepfide 1 

seizure related gene 6 (mouselike 0.9 

ESTs 0.9 

ESTs \2 

Homo sapiens mRNA; cDNA DKFZp434F152 (fr 

phosphogtyceratB dehydrogenase 0.9 
gb»q02h08^1 NCLCGAP_Lu5 Homo sapiens 

gbye208)5.s1 Stratagene king (937210) H 3.2 

ESTs, Moderately similar to WAA1215 pro 5.3 

gb2c78a07^1 Pancreatic Islet Homo sap! 0.9 

ESTs 1.1 

TRF2-Weractlng telomerte RAP1 protefo 1.1 

selenoprotein N 1.1 

ESTs,Weakrysimterto2004399Acriromos 1 

CD74 antigen (invariant polypeptide of m 1 .2 

topolsomerase-related function protein 4 1 

triple functional domain (PTPRF Interact 1.3 

myosin regulatory fight chain 1 .1 

a disintegrin and metaOoproteinase doma 1 .4 

short cofled-coil protein 2.4 
Homo sapiens mRNA; cDNA DKFZp564B1264 (f 

high density lipoprotein binding protein 1 .9 

hypothetical protein 1.4 

ESTs,WeaWysirnaartoJC5024UDP-galac 2 

hypothetical protein FU14735 1.3 
Homo sapiens cDNA FU20167 fis, done CO 1.3 

ESTs, Moderately similar to T03094 A-Wn 4.3 

lysosomal 1.5 
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128453 


X02761 


Hs.287820 fibronedlnl 


1.2 


4.3 


128460 


T16206 


Hs.237164 ESTs, Highly similar to LDHHJflJMAN L-LAC3. 1 


44.4 


128491 


H08379 


Hs.165563 hypothetical protein DKFZp434N1429 


0.6 


13.1 


128495 


NM_005904 


Hs. 100602 MAO (mothers against decapentapleglc, Dr 


1.3 


4 


128546 


NMJ303478 


Hs.101299 cullin 5 


1 


5.1 


128574 


All 85977 


Hs.38260 ubiquitin specific protease 1 8 


0.8 


4 


128611 


NM_014721 


Hs.102471 KIAA068O gene product 


1.3 


3.7 


128652 


AA432202 


Hs.103147 hypothetical protein FU21347 


1.4 


3.9 


128653 


087432 


Hs. 10315 solute carrier family 7 (catbnic amino 


1.2 


3.6 


126655 


A1246669 


Hs.324275 WWdonTaiTHContalning protein 1 


o a 


4.1 


128684 


BE246444 


Hs.283685 hypothetical protein RJ20396 


3 


1.6 


128717 


AK001564 


Hs.104222 hypothetical protein FU10702 


2.8 


4.8 


128774 


AA476220 


Hs.54457 C081 antigen (target of antiproBferaGv 


1.1 


10.6 


128790 


AF098899 


Hs.105700 secreted frizzled-related protein 4 


1 


38 
o.o 


128805 


AA1 94554 


Hs.183434 ATPase, H+ transporting, lysosomal (vacu 


5.3 


53 
o.o 


1 98897 


41838184 


Hs. 106334 Homo sapiens clone 23836 mRNA sequence 2.2 


0.0 


1 98840 


AI917R09 


Hs.106440 ESTs 


1 


4 <; 

M.O 


128869 


AA768242 


Hs.8061 8 hypothetical protein 


0.8 


36 
o.o 


128889 


050905 


Hs.106909 0KFZP566D1 93 protein 


4.6 


3.7 


198890 


41999090 


Hs.182364 CocoaCrisp 


3 


1 5 

I.O 


128915 


AK000140 


Hs. 1071 39 hypothetical protein 


0.2 


39 




AA899037 


Hs.166468 programmed ceO death 5 


2.5 


1*5 9 


128926 


API 55096 


Hs.107213 hypothetical protein FU20585 


4 


4 


\CQ SOU 


A 4908988 

MAioOaOO 


Hs.10724 MDS023 protein 


1.2 


4 5 


1 98049 


AW947R38 


Hs.10729 hypothetical protein 


1.4 


c 

0 




AW983R99 


Hs.223025 RAB31 , member RAS oncogene family 


2.3 


o.o 


128953 

l£O300 




Hs.107362 KIAA0909 protein 


0.9 


3.9 


198979 

1 COS I O 


AW971217 


Hs^81434 Homo sapiens cDNA FU14028 fis, clone HE 1.5 


3.6 




AA988094 


Hs.10758 NM_002495*:Homo sapiens NADH dehydrogena 


08 
u.o 


199001 


Ml/ l UU£0 


Hs.13323 hypothetical protein FU22069 


1.2 


57 

O.I 


129009 


C15105 


Hs.330716 Homo sapiens cDNA FU14368 fis, clone HE 2.1 


9.9 


129013 


AA371156 

nnJ( 1 IOU 


Hs.107942 DKFZP564M1 12 protein 


2.4 


3.8 


199088 

l£OUOO 


AlfrtA'W 

nlQOHOZX 


Hs.152925 KIAA1268 protein 


1.2 


38 

o.o 


129106 


AW504486 

/All JUttUU 


Hs.108689 sterol regulatory element binding transc 


1.2 


5.5 


199113 


RP543905 


Hs.288771 OKFZP586A0522 protein 


0.5 


37 

O.f 


129125 


ABO 02450 


Hs.278391 CGI-109 protein 


1 


5.2 


129126 


AW881089 

MViOO IU03 


Hs.108806 Homo sapiens mRNA; cONA DKFZp566M0947 (f 


1.5 




N93018 

m£OU IO 


•Hs.171391 C-terminalbffuling protein 2 


2.1 


9 7 


129230 


AA335362 


Hs.109646 Empirically selected from AFFX single pr 


0.9 


8.6 


199934 


M18918 

IVI 1 09 ID 


H&282997 gluoosldase, beta; acid (Includes glucos 


1.1 


o.o 


129238 


RF549914 


Hs.109697 ESTs 


1.1 


12,8 


199939 


vvo/ooo 


Hs.109701 uWquitin-Cke 5 


3.2 


5 1 

J. 1 


199941 


AI878887 
MlOf 000 1 


Hs.109706 hematological and neurological expressed 


1.9 


57 

O.f 


19Q943 




Hs.109727 TAK1-bindlng protein 2; KIAA0733 protein 


1.2 


8 8 
0.0 


19Q9A7 


RA0Q9A 


Hs.109733 CGl-131 protein 


1.5 


3 8 
0.0 


l£3£OU 




Hs.109760 Empirically selected from multiple AFFX 


1 


<; 4 

0.4 




AAzouy/u 


Hs.251946 poly(A)4>inding protein, cytoplasmic 1-1 


1.3 


4 1 




nruf i 4UU 


Hs.279813 hypothetical protein 


1.6 


0.9 




AAoa/ loo 


Hs. 109918 ras homolog gene family, member H 


1.8 


A 9 




ABOO/osb 


Hs. 1 10 putative L-type neutral amino acid trans 


1.1 


0.1 


129284 


AA318224 


Hs.296141 ESTs 


2.5 


4.8 


129300 


W94197 


Hs.110165 ribosoma! protein L26 homotog 


1.6 


5.1 


129318 


AF189062 


Hs.285976 tumor metastasis-suppressor 


1.8 


6.5 


129352 


AW511656 


Hs.170177 Melst (mouse) homotog 


O.fl 


4 


129362 


U30246 


Hs.110736 solute carrier family 12 (sodium/potassi 


1.4 


9.2 


129379 


BE278964 


Hs.11085 CGM11 protein 


1 


4.8 


129390 


AA318271 


Hs .250 905 hypothetical protein 


1 


4.1 


129416 


AA016188 


Hs.11 1244 hypothetical protein 


1.8 


10.7 


129427 


AI498631 


Hs.11 1334 fenifin, light polypeptide 


1.1 


4.8 


129470 


W92931 


Hs .250899 heat shock factor binding protein 1 


1.8 


9.3 


129472 


AL050260 


Hs.323817 DKFZP547E1010 protein 


1 


5 


129475 


NM.004477 


Hs*203772 FSHD region gene 1 


1.1 


4.2 


129498 


AA449789 


Hs.7551 1 connective tissue growth factor 


1.9 


6.8 


129501 


AI631811 


Hs.180403 STRIN protein 


1.1 


97 


129527 


AA769221 


Hs.270847 delta-tubuHn. 


1.1 


4.3 


129545 


R16087 


Hs.323769 tisplattn resistance related protein CRR 


.1 


4.2 


129579 


AW517695 


Hs.286218 Junctional adhesion molecule 1 


2.3 


3.5 



3.8 
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129606 AW958941 Hs.166254 hypofoetical protein DKFZp566l1 33 2.4 4.4 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.2 13 

129620 D79338 Hs.239720 CCR4-NOT transcripGon complex, subunit 1.6 4.6 

129621 AL110212 Hs.301005 purine-rich element binding protein B 1.1 57 
5 129634 AB020335 Hs.181300 seU (suppressor of Cn-12, C.elegans)- 0.9 4.3 

129663 AI207406 Hs.11866 transiocase of inner mitochondrial membr 1.9 4.8 

129679 AW889132 Hs.1 1916 ribokkiase 0.9 4.1 

129688 U53209 Hs.24937 transforrner-2 alpha (htra-2 alpha) 1.3 4.7 

129691 M26939 Hs.119571 collagen, type III, alpha 1 (Ehlers-Danl 4.7 3.7 

10 129712 U46386 Hs.12102 sorting naxfo 3 \2 3.6 

129747 AL050272 Hs.12305 DKFZP566B183 protein 1 8.9 

129788 BE397454 Hs.124969 Homo sapiens done 24707 mRNA sequence 1.4 3.6 

129796 BE218319 Hs.5807 GTPaseRab14 2.9 5.1 

129797 M62839 Hs.1252 apoDpoprotein H (beta-2-gtycoprotein I) 0.3 5.1 
15 129800 AF052112 Hs.12540 lysosomal 1.6 8.8 

129834 AL080084 Hs.296155 CGM00 protein 0.9 5.3 

129836 AW410233 Hs.206521 YME1 (S.cerevisiae)-iike 1 1.8 9.9 

129843 NMJJ1484Q Hs.200598 WAA0537 gene product 0,9 3.6 

129374 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 9.5 

20 129878 Z43161 Hs.283714 30 kDa protein 1.1 6.3 

129904 AL119499 Hs.13285 neuronal potassium channel alpha subunit 1 3.5 

129917 M30773 Hs.278540 prote« phosphatase 3 (formerly 2B),reg 2 5,1 

129976 X14008 Hs.234734 lysosomal 0.9 4.9 

129982 Z14221 gb:H.sapiens germline transcript of Ig h 1.2 3.6 

25 130007 R15917 Hs.142570 Homo sapiens clone 24629 mRNA sequence 4.3 1.3 

130060 BE277024 Hs.146381 RNA binding motif protein, X chromosome 1.6 3.8 

130064 X57815.comp Empirically selected from AFFX single pr U 8.2 

130068 M93143 Hs.262869 plasminog en-like 1.4 7.9 

130090 H97878 Hs. 132390 zinc finger protein 36 (KOX 18) 1.4 12.3 

30 130095 AK001635 Hs.14838 hypothetical protein FU 10773 0.2 4.6 

130102 W61005 Hs.14696 DHHC1 protein 1 . 4.1 

130112 AA916785 Hs. 180610 splicing factor proiine/glutamine rich ( U 5.3 

130115 T47294 Hs.149923 X-box binding protein 1 3.8 0.8 

130123 NMJ05095 Hs. 150390 zinc finger protein 262 1 4.2 

35 130150 BE094848 Hs.15113 homogentisate1,2-<iioxygenase(homogenti 0.5 4 

130161 R42678 Hs.1 51 385 K1AA0564 protein 1 3.7 

130210 M23115 Hs.1526 ATPase,Ca++ transporting, cardiac musd 0.4 4.4 

130213 BE278370 Hs.15265 heterogeneous nuclear ribonudeoprotein 1.7 7.5 

130215 BE301883 Hs.152707 glioblastoma amplified sequence 1 5.6 

40 130232 U29463 gfcHuman cytochrome b561 gen 1.2 A2 

130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like Iransposo 1.3 3.6 

130281 W78907 Hs.15395 similar to arginyl-tRNA synthetase (argi 1.5 4.4 

130343 AB040914 Hs.278628 KIM1481 protein 2.9 7.5 

130385 AW067800 Hs.155223 stanntocaldn 2 3.2 0.2 

45 130414 AW842182 Hs.241392 small Inducible cytokine A5 (RANTES) 1.4 10.6 

130417 AW163518 Hs.155485 huntingfln Interacting protein 2 1.7 11.7 

130440 AA852868 Hs.132853 WAA0171 gene produd 1.1 5 

1 30442 NMJJ06245 Hs.1 18244 prote&i phosphatase 2, regulatory subunl 1 .4 4.3 

130465 AW362955 Hs.15641 Homo sapiens cONA RJ14415 fis, done HE 1.6 7.6 

50 130479 R44163 Hs.12457 hypothetical protein RJ1081 4 0.9 4.1 

130499 AB007915 Hs.158286 KIAA0446 gene produd 1 3.8 

130546 AI598022 Hs.193989 TAR DMA binding protein 1.3 4.7 

130568 AA232119 Hs.16085 putative G-proteln coupled receptor 1.2 9.4 . 

130606 AI652143 Hs.288382 hypothetical protein FU131 11 1 4.1 

55 130612 BE242873 Hs,16677 WD repeat domain 15 1.1 3.6 

130616 AL049963 Hs.284205 up-regulated by BCG-CWS 0.6 3.8 

130623 AUM5128 Hs.1691 glucan (1,4-alpha-), branching enzyme 1 0.9 6.6 

130629 AL042896 Hs.1697 ATPase, H+transporBng, lysosomal (vacu 0.9 3.9 

130632 AW073971 Hs.238954 ESTs, WeaWy similar to K1AA1204 protein 0.9 6.9 

60 130639 AI557212 Hs.17132 ESTs, Moderately similar to I54374 gene 2.6 3,9 

130641 AF158555 Hs^39189 glutaminase 1.2 13^ 

130653 AI861791 Hs.278479 TSPY^ike 1.3 4 

130655 AI831962 Hs.17409 cysteine-rich protein 1 (intestinal) 2.5 4 

130666 AL117508 Hs.194035 WAA0737 gene produd 1.3 6.2 

65 130669 A1928985 Hs.17680 hypofiieticaJprotenMGC1314sirnibrto 1.4 35 

130693 R68537 Hs.17962 ESTs 3^ 0.8 

130694 NIAJM627 Hs.17969 KIAA0663 gene produd 1.1 4.8 
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130696 


AA325308 


Hs.18016 Homo sapiens mRNA; cONA DKFZp586H0324 (f 


1.8 


130701 


298883 


Hs.16079 phosphatldyHnositol glycan, dass Q 


1.1 


6.7 


130707 


AW190925 . 


Hs.203559 hypothetical protein FU1 2701 


1.2 


4.1 


130731 


AI932971 


Hs.18593 Homo sapiens cONA: FU21449 fis, clone C 


1.4 


6.9 


130787 


AF072813 


Hs.252831 reticuton3 


1.2 


11.2 


130796- 


AA088809 


Hs.19525 hypothetical protein FU22794 


1.8 


6.8 


130803 


NM_0017G1 


Hs.1973 cydinF 


1.3 


4.1 


130863 


Y10805 


Hs.20521 HMT1 (hnRNP methytlransferase, S. cerevl 


3.2 


5.9 


130902 


AB03775Q 


Hs.21061 K1AA1329 protein 


1 


. 3.8 


130908 


AW195747 


Hs.21 122 . hypothetical protein FLi 11830 similar to 


1.3 


7.9 


130911 


BE409769 


Hs.21189 DnaJ (Hsp40) homolog, subfamily A, membe 2.7 


3.7 


130913 


BE390905 


Hs.21 198 translocase of outer mitochondrial membr 


1.9 


4 


130923 


H96115 


Hs.21293 UDP-N-acteylghJcosamine pyrophosphoryias 1.9 


10.3 


130959 


AB023182 


Hs.184523 WAA0965 protein 


1.5 


6.8 


130987 


AA393071 


Hs.182579 leucine aminopepfeJase 


1.4 


5.5 


130975 


AA099923 


Hs.283728 PEST-contaMng nuclear protein 


1.3 


3.8 


131037 


BE243101 


Hs.22391 chromosome 20open reading frame 3 


1.9 


4.1 


131039 


D87436 


Hs.166318 Bpin2 


1.6 


3.5 


131060 


AA194422 


Hs.22564 myosin VI 


4.5 


5 


131097 


AL137682 


Hs.22937 Mcappa-B-interacting Ras-like protein 2 


2 


3.7 


131101 


BE387561 


Hs.22931 DKFZP586M1523 protein 


1.6 


• 4.5 


131104 


W27770 


Hs.301756 ESTs, Weakly similar to T31475 hypotheti 


0.9 


3.5 


131107 


BE620886 


Hs.75354 GCN1 (genera) control of amlno-acid synt 


2.1 


4.5 


131109 


BE564123 


Hs.23060 DKF2P564F0522 protein 


1.1 


4.6 


131136 


AB033099 


Hs.23413 KIAA1273 protein 


1.2 


4.2 


131148 


AW953575 


Hs.303125 p53-lnduced protein P1GPC1 


4.5 


13.5 


131150 


X77753 


Hs.23582 tumor-associated calcium signal transduc 


3.4 


0.4 


131156 


AI472209 


Hs.323117 ESTs 


0.8 


4.9 


131164 


AW013807 


Hs.182265 keratin 19 


3.3 


2.4 


131181 


H25094 


Hs.293663 ESTs, Moderately similar to 138022 hypot 


0.6 


4 


131194 


AW864222 


Hs.24083 KIAA0997 protein 


1.4 


. 3.8 


131199 


AW979155 


Hs.298275 amino acid transporter 2 


1.2 


8.5 


131215 


AL050107 


Hs.24341 transcriptional co-activator with PDZ-bi 


0.7 


4.7 


131216 


A1815486 


Hs.243901 Homo sapiens cDNA FU20738 fis, clone HE 


2.1 


8.2 


131233 


D89053 


Hs.268012 fatty-acid-Coenzyme A Egase, long-chain 


1.7 


3.5 


131237 


AW956668 


Hs.24608 DKFZP564D177 protein 


1.3 


5.4 


131262 


AU077158 


Hs.24930 tubuUrv-specdfic chaperone a 


1.6 


4.8 


131263 


AU077002 


Hs.24950 regulator of G-protain signalling 5 


1.4 


4.4 


131367 


AI750575 


Hs.173933 nuclear factor l/A 


3.3 




131372 


AW293399 


Hs. 144904 nudear'receptor co-repressor 1 


1.6 


3.9 


131373 


NM_006052 


Hs.26146 Down syndrome critical region gene 3 


1 


11.1 


131388 


NNL014810 


Hs.92200 K1AA0480 gene product 


5 


2 


131492 


AI452601 


Hs.288869 nuclear receptor subfamily 2, group F, m 


0.9 


3.5 


131493 


AW960146 


Hs.284137 hypothetical protein FU12888 


1 


3.5 


131514 


BE270734 


Hs.2795 lactate dehydrogenase A 


2 


6.5 


131524 


AB040927 


Hs.301804 KIAA1494 protein 


1.5 


10.7 


131528 


AU076408 


Hs.28309 UDP-glucose dehydrogenase 


13 


4.7 


131534 


AF157326 


Hs.184786 TBP-interacting protein 


1,3 


4.9 


131555 


T47364 


Hs.278613 interferon, atpha-inducible protein 27 


1.5 


8 


131578 


AA936296 


Hs.234265 DKFZP586G011 protein 


1.8 


3.5 


131589 


CI 8825 


Hs.29191 epithelial membrane protein 2 


1.3 


8.2 


131609 


D83032 


Hs. 169984 nuclear protein 


2.8 


3.9 


131626 


BE514605 


Hs.289092 Homo sapiens cDNA: FU22380 fis, done H 


1-3 


112 


131670 


H03514 


Hs.10130 ESTs 


1.3 


4.8 


131697 


C19034 


Hs.288613 Homo sapiens cDNA FU14175 fis, done NT 3.2 


9.7 


131701 


AF103798 


Hs.30819 hypothetical protein 


1.3 


$2 


131703 


AW160865 


Hs.30888 cytochrome c oxidase subunit Vila polype 


1.3 


7.8 


131739 


AF017986 


Hs.31386 secreted frizzted-retated protein 2 


10.6 


14.7 


131764 


AI805664 


Hs.31731 peroxiredaxln 5 


1.1 


3.6 


131781 


AF077036 


Hs.31989 DKFZP586G1722 protein 


1.6 


3.7 


131791 


X62111 


gMisaplens Vli-5 gene for Immunogiobul 


1.1 


3.5 


131853 


AI681917 


Hs.3321 ESTs, Highly similar to IRX1 HUMAN IROQU 5.3 


U 


131870 


NMJ)14874 


Hs.3363 WAA0214 gene product 


0.6 


A2 


131903 


NMJJ04642 


Hs.3436 deleted In oral cancer (mouse, homolog) 


2.4 


4.9 


131913 


AW207440 


Hs. 185973 degenerative spermatocyte (homolog Droso 


2.4 


6 


131930 


M772603 


Hs.69476 Homo sapiens cDNA FU12758 fis, done NT 1.7 


9.2 


131941 


BE252983 


Hs.35086 ubiquilinspedfic protease 1 


0.5 


5.2 
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131947 AI123939 
131961 AA129782 
131964 AW381148 
131974 AF208856 
131983 AF1 19665 
131997 AF229181 
132006 AW162336 
132063 BE277910 
132065 BE379335 
132071 AF217798 
132079 AI701457 
132094 NMJ)16Q45 
132116 AW960474 
132164 AI752235 
132181 AW961231 
132208 AL031709 
132258 AA306325 
132303 BE1 77330 
132316 U28831 
132358 NM.003542 
132384 AA312135 
132397 AA021160 
132413 AW361383 
132442 AW970859 
132534 BE388673 
132540 BE396290 
132554 AF065391 
132575 AV660538 
132585 AF029750 
132602 AW606927 
132608 AA353044 

132718 NMJ304600 

132719 AI264357 
132730 AK000868 
132765 BE222975 
132782 F07424 
132793 AB020713 
132805 AW975748 
132863 BE268048 
132894 D63209 
132930 AA579258 

132932 AW1 18826 

132933 BE263252 
132965 AI246173 
132984 BE539199 
132990 X77343 
132998 Y00062 
133002 AW499985 

133011 NM.O06379 

133012 AA847843 
133040 AW502761 
133056 H12028 
133063 AI654133 
133067 AK000708 
133080 AF089816 
133110 AA808177 
133150 AV655783 
133175 AW955632 
133199 AF231981 
133203 AA464362 
133206 AB037773 
133221 W32474 
133229 AL137480 
133241 AW796524 
133257 8E617892 
133271 Z48633 
133273 N27672 



Hs.182997 ESTs 

Hs.3576 Homo sapiens mRNA full length insert cDN 
Hs.198365 2,3-bisphosphogtycerate mutase 
Hs.268122 hypothetical protein 
Hs.18401 1 pyrophosphatase (Inorganic) 
Hs.136644 CS box-containing WD protein 
Hs.3709 low molecular mass ubiquinone-fainding pr 
Hs.3833 3'-phosphoadenosine S'-phosphosuJfete sy 
Hs.211594 prateasome (prasome, macropain) 26S subu 
Hs.3850 USI-lnteracting protein NUDEL; endoolfc 
Hs.38694 ESTs 
Hs.3945 CGM07 protein 
Hs.40289 ESTs 

Hs.41270 procoSagen^sine,2-oxoglutarate5^io 
Hs.16773 Homo sapiens done TCCCIA00427 mRNA sequ 
Hs.241575 hypoftetica! protein CAB56184 
Hs.431 1 SUMO-1 activating enzyme subunit 2 
Hs.325093 Homo sapiens cONA:RJ2 1210 lis. clone C 
Hs.44566 KIAA1641 protein 
Hs.46423 H4 histone family, member G 
Hs.46967 HSPC034 protein 
Hs.4750 hypolheSca! protein DKFZp564K0822 
Hs.260116 metaitoprotease 1 (pitrilysin family) 
Hs,313503 ESTs 

Hs.5086 hypothetical protein MGC10433 
Hs.5097 synaptogyrin 2 
Hs.194718 zinc finger protein 265 
Hs.284162 60S ribosomal protein L30bo!og 
Hs.179600 TAP biding protein (tapasin) 
Hs.5306 hypothetical protein WFZp5$mm simfl 
Hs.5321 ARP3 (actln-related protein 3, yeast) ho 
Hs.554 Sjogren syndrome antigen A2 (60kD, ribon 
Hs.55405 hypothetical protein MGC16212 
Hs.5570 hypothetical protein FU10006 
Hs.56205 Insufin Induced gene 1 
Hs.279840 zinc finger protein 222 
Hs.56966 KIAA0906 protein 
Hs.5724 scterostin 

Hs.236494 RAB10, member RAS oncogene femily 
Hs.5944 solute carrier family 1 1 (proton-coupled 
Hs.6083 Homo sapiens cDNA: FU21028 fis, clone C 
Hs.6093 Homo sapiens cDNA: FU22783 fis, clone K 
Hs.6101 hypothetical protein MGC3178 
Hs.191460 hypothetical protein MGC12936 
Hs.62112 zinc finger protein 207 
Hs.334334 transcription factor AP-2 alpha {activat 
Hs.170121 protein tyrosine phosphatase, receptor t 
Hs.42915 ARP2{actuweiated protein 2, yeast) ho 
Hs.171921 sema domain, Immunoglobulin domain (Ig), 
Hs.62711 Homo sapiens, done IMAGE:3351295, mRNA1 
Hs.30909 WAA0430 gene product 
Hs.6396 jumping translocation breakpoint 
Hs.30212 thyroid receptor interacting protein 15 
Hs.169764 hypoftetteaJ protein FU20701 
Hs.6454 chromosome 19 open reading frame 3 
Hs.65228 ESTs 

Hs.661 Empirically selected from AFFX single pr 
Hs,66666 ESTs, WeaMy similar to S 195 60 profine-r 
Hs.250175 homolog of yeast long chain poryunsatura 
Hs.6748 hypothetical protein PP1665 
Hs.6762 hypothetical protein 
Hs.301746 RAP2A, member of RAS oncogene family 
Hs.6834 WAA1014 protein 
Hs.68644 Homo sapiens microsomal signal peptidase 
Hs.6895 actin related protein 2/3 complex, subun 
Hs.283742 Rsapfens mRNA for retrotransposon 
dendrite ceil protein 
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133287 AW797437 

133291 BE297855 

133292 AA3Q4961 
133294 AJ001388 
133300 AF1 16666 
133302 X04898 
133308 U56979 
133347 BE257758 
133370 AF245505 
133404 AS007916 
133408 AI738719 
133422 AB033061 
133442 AL137663 

133448 M27749 

133449 AF038962 
133501 AI962602 
133504 NM.Q04415 
133506 BE562958 
133532 D87452 
133574 H97991 
133586 AI929645 
133589 L37368 
133591 AI423369 
133606 U10564 
133617 BE244334 
133651 AI301740 
133660 H14843 
133663 AJ006239 

" 133668 L77964 
133671 AW503116 
133681 AI352558 
133694 W17187.comp 
133708 AI018666 . 
133737 AW001130 
133743 AI929587 
133750 BE410769 
133765 M62194 
133776 BE268649 

133799 W24087 

133800 AF075337 
133802 AW239400 
133806 D25969 
133817 AW578716 
133829 AW630088 
133841 AA345824 
133845 AA147026 
133863 AI815523 
133887 X07767 
133892 AW859528 

133913 AU076964 

133914 AI458213 
133917 AL031177 
133947 L41066 

133986 M54968 

133987 L15409 

133989 AL040328 

133990 R48316 
134029 BE150882 
134040 NMJJ03470 
134042 AI027881 
134049 AF1 17236 
134095 NMJJ04354 
134098 BE513171 
134207 Z43039 
134210 AF035606 
134218 U77735 
134270 X68194 



Hs.69771 
Hs.69855 
Hs.699 
Hs.69997 
Hs.70333 
Hs.237658 
Ks.250651 
Hs71475 
Hs,72157 
. tts.214646 
Hs.198427 
Hs.73287 
Hs.7378 
Hs.288168 
Hs.7381 
Hs.74284 
Hs.74316 
Hs.74346 
Hs74579 
Hs.193313 
Hs.225936 
Hs.75104 
Hs.75111 
Hs.75188 
Hs.75249 
Hs.173381 
Hs.303154 
Hs.75438 
Hs.271980 
Hs.301819 
Hs.75544 
Hs.232400 
Hs.75667 
Hs.75824 
Hs.75847 
HsJ5873 
Hs.75929 
Hs.177766 
Hs.76285 
Hs.76293 
Hs.76297 
Hs.76325 
Hs.7644 
Hs.76550 
Hs.76688 
Hs.76704 
Hs.76930 
Hs.77271 
Hs.301497 
Hs.7753 
Hs.77542 
Hs.7756 
Hs.77810 
Hs.184050 
Hs.174007 
Hs.78202 
Hs.7822 
Hs.143601 
Hs.78683 
Hs.7869 
Hs.78825 
Hs.79069 
Hs.79086 
Hs.170198 
Hs.80019 
Hs.80205 
Hs.80919 



B-factor, properdin 1.3 
NRAS-related gene 1.4 
peptidytprolyl Isomerase B (cyclophilin 2^2 
zinc finger protein 238 1.5 
hypothetical protein MGC10753 1.4 
apoGpoprotein A-ll 0.2 
H factor 1 (complement) 0.6 
add duster protein 33 \2 
OKFZP564I1922 protein 3.7 
KlAA0447gOTe product 1.4 
hexoklnase 2 0 9 

KIAA1235 protein \2 
Homo sapiens mRNA; cDNA DKF2p434G227 (fr 
immunoglobulin lambda-like polypeptide 1 1.1 
voltage-dependent anion channel 3 0.7 
hypoflieGcal protein MGC2714 3.1 
desmoplakinfDPl.DPII) 4.3 
hypoffietteal proteto MGC14353 1.8 
KIAA0263 gene product 1.2 
Empirically selected from AFFX single pr 1.4 
synapsinl 0.8 
RNA-binding protein SI. serine-rich doma 2 
protease, serine, 11 (IGF binding) 2.1 
wee1+ (S. pombe) homolog 3,3 
ADP-ribosytafion (actor-like 6 interact! 2.3 
dOiydropyilmldinase-like 2 0.8 
popeye protein 3 1 
quinoid dfoydropteridine reductase 0.5 
mitogen-activated protBin kinase 6 1.1 
zinc finger protein 146 1.8 
tyrosine 3-fnonooxygenase/tryptophan 5-mo 1.5 
heterogeneous nuclear ribonudeoprotein 2 
synaptophysin 0.6 
KIAA0174 gene product 1.2 
CREBBP£P300 inhibitory protein 1 1.5 
zyxin \2 
cadherin 11, type 2, OB-cadherin (osteob 3.2 
ADP-ribosyltransferase (NAD^ poly (ADP- 2.1 
OKFZP564B167 protein 1.9 
thymosin, beta 10 2.6 
G protein-coupled receptor kinase 6 1 
step II splicing factor SLU7 0.5 
H1 hbtone family, member 2 1.5 
Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
carboxy [esterase 1 (monocyte/macrophage 0.3 
ESTs 5.5 
synuclein, alpha (non A4 component of am 0.6 
protein kinase, cAMP-dependent, catalyti 1 
arginytiransferase 1 0.9 
calumenln 2.8 
ESTs 1>8 
proteasome (prosome, macropain) 26S subu 15 
nudearfactorofactivatedT-cells.cyt 1.5 
v-Ki-ras2 Kirsten rat sarcoma 2 viral on 0.9 
von HippeHJndau syndrome 2.3 
SWl/SNF related, matrix associated, actl 3.3 
Homo sapiens mRNA; cDNA DKFZp564C1216 (f 
hypothetical protein hCLA-iso 1 
ubJquitin specific protease 7 (herpes vl 1 .7 
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matrin3 
cycQnG2 

mitochondrial ribosornal protein L3 
KIAA0009 gene product 
programmed ce0death6 
plm-2 oncogene 
synaptophysm-IIke protein 
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134277 NMJ04369 Hs.80988 coOagen, type VI, alpha 3 

134280 NNL000712 Hs.81029 bitadin reductase A 

134288 AI022650 Hs.8117 erbb24nteracting protein ERBiN 

134296 R00603 Hs.8128 phosphatidylserine decarboxylase 

5 134300 NM.001430 Hs.8136 endothelial PAS domain protein 1 

134310 AL037800 Hs.8148 selenoprotein T 

134343 D50683 Hs.82028 transforming growft factor, beta recepto 

134364 X76534 Hs.82226 glycoprotein (transmembrane) nmb 

134374 N22687 Hs.8236 ESTs 

10 134378 AL035786 Hs.82425 adin related protein 2/3 complex, subun 

134382 BE512856 Hs.109051 SH3 domain binding glutamic acid-rich pr 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, m 

134417 NM.006416 Hs.82921 solute carrier family 35 (CMP-sia5c act 

134421 AU077196 Hs.82985 collagen, type V, afcha 2 

1 5 134439 Z23024 Hs.138860 Rho GTPase activating protein 1 

134454 NM.013230 Hs.286124 CD24 antigen (small cell King carcinoma 

134494 D86981 Hs.84084 amyloid beta precursor protein (cytoplas 

134501 W84869 Hs.211568 eukaryotic translation kiMon factor 

134505 AW960673 Hs. 177530 ATP synthase, H+ transporting, mitochond 

20 134520 GE091005 Hs.74861 activated RNA polymerase ll transcriptio 

134528 M23161 Hs.84775 Human transposon-fike element mRNA 

134545 AI902899 Hs.85155 butyrate response factor 1 (EGF-response 

134553 AI203545 Hs.296169 S-phase response (cydlrwelated) 

134573 NfvL016142 Hs.279617 steroid dehydrogenase homolog 

25 134576 AB033017 Hs.8594 KIAA1 191 protein 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor 

134579 AW936928 Hs.85963 DKFZP564M182 protein 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 

134600 AF078859 Hs.86347 hypottierJcal protein 

30 134655 AF265208 Hs.123090 SWt/SNF related, matrix associated, acrj 

134700 AK000606 Hs.8868 golgiSNAP receptor complex member 1 

134737 D17530 Hs.89434 drebrinl 

134762 T51986 Hs.283108 hemoglobin, gamma G 

134843 AA428520 Hs.90061 progesterone binding protein 

35 134854 J03464 Hs.1 79573 collagen, type I, alpha 2 

134865 AA587775 Hs.66295 mulfi^D2^maIrMX)ntaining protein 

134868 AB020689 Hs.90419 WAA0882 protein 

134874 A1803761 Hs.90458 serine palrnitoyltransferase, long chain 

134685 AJ002030 Hs.9071 progesterone membrane binding protein 

40 134891 R51083 Hs.90787 ESTs 

134908 BE089782 Hs.9877 hypothetical protein 

134934 AF005043 Hs.91390 poly (ADP-ribose) glycohydrolase 

134970 BE560779 Hs.284233 NICE^S protein 

134982 AK002085 Hs.92308 Homo sapiens cDNA FU1 1223 lis, clone PL 

45 135011 AB037835 Hs.92991 KIAA1414 protein 

135032 AW301984 Hs.173685 hypothetical protein FU12619 

135035 AL034344 Hs.284186 forkhead box C1 

135051 AI272141 Hs.83484 SRY (sex determhing region Y)-box 4 

135060 AK001887 Hs.259842 protein Wnase, AMP-acfivated, gamma 2 n 

50 135062 AK000967 Hs.93872 WAA1682 protein 

135077 AW503733 Hs.9414 WAA1488 protein 

135082 AB017363 Hs.94234 frizzled (DrosophBa) homolog 1 

135107 T97257 Hs.337531 ESTs, Moderately similar to 138022 hypot 

135143 AA132813 Hs.69559 KIAA1 096 protein 

55 . 135156 BE563088 Hs.9552 binder of Art Two 

135172 AB028956 Hs.12144 WAA1033 protein 

135181 BE250865 Hs.279529 px19-0ke protein 

135222 AA534009 Hs. 183487 interferon stimulated gene (20kD) 

135232 AL038812 Hs.96800 ESTs,ModeraterysJrnnartoALU7J1UMANA 

60 135289 AW372569 Hs.9788 hypothetical protein MGC10924 sbnflar to 

135290 AA331901 Hs.184736 hypoBietical protein HJ10097 

135291 T83882 Hs.97927 ESTs 

135349 AA1 14212 Hs.9930 serine (or cysteine) proteinase Inhibito 

135357 AI565004 Hs.79572 cathepsln O (lysosomal aspartyl protease 

65 135398 M16Q29 Hs.287270 ret proto-oncogene (rnulfipie er>docrtne n 

135399 W79431 Hs.326249 ribosomal protein L22 

135400 X78592 Hs.99915 androgen receptor {dhydrotestosterone r 
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302665 
302692 
302963 
303131 
303150 
310125 
312662 
319429 
320591 
406779 
410691 
410763 
415738 
420186 
422055 
425815 
426218 
427397 
427466 
427505 
427723 
428673 
430219 
430450 



R99693 Hs.224410 

AW176909 Hs.42346 

AW673106 Hs.151945 

AW081061 Hs.103180 



AA887146 
AA147979 
AA233808 
6E616412 
AA054761 
M412048 



Hs.8217 

Hs.285005 

Hs.286241 

Hs.286218 

Hs.169149 

Hs.279574 



AW239226 Hs.65450 

AF279145 Hs.8966 

BE539367 Hs.295953 

NM_015925 Hs.95697 

NWL014320 Hs.111029 

R94023 Hs.337531 



AF1 19043 
AI929685 
AA523543 
AA361562 
AI355260 



Hs.168005 

Hs.177656 

Hs.7678 

Hs.178761 

Hs.279789 



AW601325 Hs.324278 
X99209 Hs.235887 



R23553 Hs.241489 

25 432866 BE395875 Hs.279609 

BE407127 Hs.8997 

AB001636 Hs.5683 

BE616412 Hs.286218 

R60366 Hs.5822 

BE513940 Hs.6101 

AL042986 Hs.7857 

BE281316 Hs.47334 

H51066 Hs.23581 



433423 
437562 
437667 
437754 
440252 
441471 
448292 
449404 
449964 
451389 
452685 



AW001741 Hs.273193 
N73222 Hs.279009 



AJ634651 
RC_H15847_s 
RC.W84712 
X14008_ma1 f 
RC_H86543 f 
H07011 

RC>A164586 s 

RCLAA070485 

RC_H98714_s 

RCJW406145J 

AA458584 

AA031548 

X02761 

RCJW87193 

R25326 

RCJU393805 

RCJVA449333 

ROjAA287681_s 

RCJVA490864 

RC_C14243J 

R21443 

RC>A251902 

M21121.S 

C00038_9 

Y00503 

RO.R27006J 

RCJVA416886 

RCJ^460450 

RCJW488433 

RCLAA2784Q0J 

U28831 

RC.AA199588 



Hs.30250 



Homo sapiens cONAFUl 2843 0s, clone NT 3.6 
caJdneum-binding protein caisardn-1 3.3 
mitochondrial ribosoma! protein L43 0.9 
DC2 protein 3 
stroma) antigen 2 6.2 
mitochondrial import receptor Tom22 1 .2 
protein kinase, cAMP-dependent, reguiato 1 
junctional adhesion molecule 1 1 .5 

karyopherin alpha 1 (importin alpha 5) 1.2 
CGI-39 protein; cefl death-regulatory pr 1 .3 
reticulon4 1.2 
hypothetical protein FU21776 2 
ESTs, Weakly similar to AF220049 1 uncha 1.3 
fiver-specific bHLH-Zip transcription fa 1 .5 
putative heme-binding protein 2 
ESTs. Moderately similar to 138022 hypot 1 .7 
Homo sapiens cDNA RJ13372 lis, done PL 3.3 
calmodulin 1 (phosphorylase kinase, delt 1 .3 
cellular retinae acid-binding protein 1 1.1 
26S proteasome-assodated pad1 homolog 3.2 
hlstone deacetylase 3 2.8 
Homo sapiens mRNA; cDNA DKFZp566M063 (fr 
HMT1 (hnRNP melhyltransferase, S. cerevi 1 .8 
hypothetical protein 
mitochondrial carrier homolog 2 
heat shock 70kD protein 1A 
DEAD/H (Asp-Glu-Ala-Asp/His) box poiypep 
junctional adhesion molecule 1 
Homo sapiens cONA: FU221 20 fis, clone H 
hypothetical protein MGC3178 
erythrocyte membrane protein band 
hypothetical protein FU14495 
leptin receptor gene-related protein 
hypothetical protein FU10706 
matrix Gla protein 

v-maf musculoaponeurotic fibrosarcoma (a 
pepb'dytprolyl isomerase B (cydophiDn B) 
calumentn 

lysozyme (renal amyloidosis) 
ESTs 

ESTs; Weakly similar to SAS [Ksapiens] 



1.1 
1.5 
1.3 
1.6 
1.3 
2 

1.1 
0.5 
2.5 
1.1 
1.4 
4 

0.8 
1.8 
3.5 
0.9 
1.8 
1.8 
ESTs 
3.4 
1.6 
ESTs 
3.4 



Homo sapiens clone 23967 
ESTs 

SRY (sex determining region Y)-box 4 
cell division cycle 42 (GTP-binding protein; 25kD) 
fibronectin 1 

secreted mzziedmelated protein 4 
Homo sapiens mRNA for putative vacuolar 
ESTs; Weakly similar to (deffine not 
ESTs 



ESTs; Highly similar to heat shock factor 
ESTs; Highly similar to heat shock factor 
ESTs 

Homo sapiens lysophosphoCpase (LPL1) 

small inducible cytokine A5 (RANTES) 

ESTs 

keratin 19 

ESTs 

ESTs; Weakly similar to predicted using 
fibroblast growth factor receptor 2 (bacteria- 
ESTs; Weakly similar to deduced amino acid 



3.6 
4.7 
0.9 
1.1 
2.9 
ESTs 
1.4 
1.7 
1.6 
2.2 
0.9 
2.8 
3.1 
1.6 
3.1 
1.5 
1.1 



3.6 

I. 6 
4.2 
17.3 
4 

6.6 
3.5 
4.7 
, 5.6 
3.5 
135 
5.1 
3.9 
6.2 

II. 3 
3.6 
2.8 
4.7 
3.7 
2.5 
22 

I. 1 
8.8 
5.6 
6.1 
7.6 
6.5 
3.5 
5.7 
6.2 
3.7 
4.9 
3.6 
3.5 

II. 2 
5.6 
4.8 
4.6 
4.5 
6.6 
3.9 
6.2 
2.6 
3.5 
4.6 
0.4 
3.1 
15.2 
4 

5 

8.4 

4.6 

1.3 

5 

5 

3.7 
3.8 
9.9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 



5.2 



0.6 



3.9 



Homo sapiens HR1HFB2115 mRNA; partial cds 
Human protein Immuno-reactive with anti-PTH4.4 0.6 
Homo sapiens acfowelated protein Arp3 (ARP3) 1 .8 4.7 
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AF006082 
RC_H90899 
RCJV95070 
RC.T90946J 
D87258 
AA313414_S 
RC_H73484_s 

AFFX-HUMISGF3A/M97935_3 
AFFX-HUMRGE/M10Q98_5 
AFFX-M27830J 
AFFX-M27830.5 
RC.AA063431J 

RC T63769 J ferritin; fight polypeptide 



Homo sapiens actin -related protein Arp2 (ARP2) 
desmoplaldn(DPI;DPI!) 5.4 
desmop!aKin(DPI;OP!0 5 
Human mRNA for K1AA2S3 gene; complete cds 
protease; serine; 1 1 (IGF binding) 2.4 
EST* Weakly similar to cDNA EST EMBLT1 157 
ESTs; WeaJdy similar to similar to Yeast 1.3 



1.1 



ESTs 
1.1 



1.6 
5.5 
2.6 
1.1 
3.5 
15 
6.3 
2.3 
7.9 
0.5 
0.6 
0.8 
3.7 



10.9 

3.9 

5.3 

13,5 

7.4 
5.4 
4.1 



189 



WO 02/059377 



PCT/US02/02242 



TABLE 8A 



Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108469 1 16761J AA079487 AA128547 AA128291 AA079587 AA079600 
125076 190299.1 AA973971 T88817 AA253263 

114636 109698J AA075488 AA129081 AA074851 AA082852 AA074732 AA084908 M084751 AA076042 AA131172 AA085374 AA079519 
AA074510 AA113824 AA102437 AA070833 AA070143 AA0B4693 AA084389 M076373 AA075492 AA062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA083115 AA070942 AA085296 

123526 genbankJA608657 AA608657 

123533 genbankJ\A608751 AA608751 

125090 genbankJ91518 T91518 

125154 genbank_W38419 W38419 

118475 genbankJJ66845 N66845 

118505 genbanLN67343 N67343 

101046 enueO<01160K01160 

129982 221 J67 Z14221 AW381862 M97920 AW401444 Z66542 M2947Q AW406502 X61011 M34024 M327072 214166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z11946 Z47226 AF062205 AF174012 
101276 AF062168 AF062136 X81755 XB1748 AF174019 AF062285 L06924 L11699 AW402665 L09070 L28049 L08090 
AW407843 214171 AW402944 L08083 AW405627 L33035 126907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U6B231 L12184 AF062242 AF022000 U64499 U00570 A1268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 218318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF1 151 17 Z92896 U21254 Z92895 AF1 151 12 AF062290 T28938 Y09386 AF174087 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 UJ4337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 104342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514D78345 T29140 J00231 NfVL0Q2179AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 A1872299 AA715266 AW404328 A1831674 AI709348 AA603112 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T98113 

108470 genbanU\A079500 AA079500 
101447 entre*_M21305 M21305 
124447 genbankJMSOOO N48000 
101624 entre?_M55998 M55998 

131791 221.260 X621 11 S67984 AJ131056 Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
X69861 AW4029S4 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 
AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 298714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW354072 H45561 H45556 H42605 M735017 T47421 R48719 H27570 
H44599 A1459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ012264 AA664201 A1880450 AA327310 AI991250 
A1833028 AW001210 AI956075 H30467 AA326915 H41943 A1749266 AI744441 AA327377 AW51 2326 AI735170 H01634 
AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 



130064 221J64 X57815 L29157 AA367448 AW32B098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 



X57819 X57823 AW405604 AW4Q4447 Z34914 AW406542 AAA27726 AA604389 AW405606 AW405918 AW4D51 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 
U09902 AW405284 M1 2025 AA36021 9 103631 L03629 AW327727 AF194810 AF058076 AF194686 AF0S3774 AF063737 
AF063755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF19481 2 AF19481 1 
AF194809 AF194814 AF194808 AF064505 AF064503 AFQ63765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AFO64506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916AJ241405AF194632AJ388659AF194625 Z74694 AF194588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 084141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91134 U21249 Z46346 AJ132426 AF103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AW45389 AW383753 M360256 AF099676 H21654 H39501 AI820828 H53689 
W26785 AW384496 AW407708 AA541663 AA91 1602 AI821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 
AA864302 H25133 D87023 J0301 1 M61771 087017 AA526865 AA253450 



130232 18831.2 U29463 U06715 W24970 AA584303 T39581 M155603 AA305043 AA429426 W05664 AA102382 AA482044 W24487 AA31906O 
T88946 F10106 AA232161 AA243117 AA158937 AA100864 
109097 genbanK_AA167512 AA167512 



124842 217726J F 
103758 AA084874J_at 



R56485R37248 R59992 
t AA084874J 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Tabie 9 depicts a preferred group of genes upreguiated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unigene number 

Unlgene Title: Unigene gene title 



15 Pkey ExAccn UnlgenelO UnlgeneTitie 



20 



25 



30 



35 



100630 
102211 
103587 
104115 
105038 
105500 
105990 
106155 
106373 
106414 
110009 
111900 
114540 
116470 
117280 
119771 
121723 
124059 
131148 
132371 
134169 
302235 
452410 



AA383256 

BE314524 

BE270266 

AF183810 

AW503733 

AW602166 

AI690566 

AA425414 

AW503807 

BE568205 

BE075297 

AF131764 

AI904232 

AI272141 

M18217 

A1905687 

AA243499 

BE387335 

AW953575 

AA235448 

AI690916 

AL049987 

AL133619 



Hs.1657 

Hs.78776 

Hs.82128 

Hs.26102 

Hs.9414 

Hs.222399 

Hs.29403 

Hs.33287 

Hs.21907 

Hs.28827 

Hs,6614 

Hs.25318 

Hs.75323 

Hs.83484 

Hs.172129 

Hs.2533 

Hs.104800 

Hs.283713 

Hs.303125 

Hs.46677 

Hs.178137 

Hs.166361 

Hs.29383 



1 

putative transmembrane protein 

5T4 oncofetal trophoblast glycoprotein 

opposite strand to trichorhinophalangeal syndrome 1 

KIAA1488 protein 

CEGP1 protein 

hypothetical protein FU22060 . 
nuclear factor l/B 



mltogen-acUvated protein kinase kinase kinase 2 

ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal 

Homo sapiens done 25194 mRNA sequence 

prohlbttin 

SRY (sex determining region Y)-box 4 

Homo sapiens cDNA: FU21409 Gs, done COL03924 

EST 

hypothetical protein HJ10134 

ESTs, Weakjy similar to S64054 hypotheticai protein YGLOSOw 

p53-induced protein PIGPC1 

PRO2000 protein 

transducer of EHBB2, 1 

Homo sapiens mRNA; cDNA DKFZp564F1 12 

Homo sapiens mRNA; cONA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to nonnal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to norma! body tissue 

R2: Ratio of 90* percentile tumor to body 

R3: Ratio of 75* percentile body to tumor 

R4: Ratio of tumor to normal breast fissue 



Pkey ExAccn UnigenelD 


Unigene Title 


R1 


R2 


R3 


R4 


100082 AA1 30080 Hs.4295 


proteasome (prosome, macropaln) 26S subu 


4.2 


152 


36 


12.2 


100103 AA380887 Hs.5085 


dollchyl-phosphate rnanrwsyttiansferase p 


9.8 


123 


13 


5 


100131 012485 Hs.11951 


ectonudeotide pyrophosphatase/phosphodi 


132 


244 


19 


9.9 


100147 D13666 Hs.136348 


osteoblast specific factor 2 (fasdclin 


15.7 


1030 


66 


5 


100154 H60720 Hs.81892 


KIAA0101 gene product 


4.1 


320 


78 


10.6 


100157 D14661 Hs.119 


Warns 1 tumour 1 -associating protein 


4.7 


119 


26 


3 


100169 AUJ 37228 Hs.82043 


0123 gene product 


5.1 


105 


21 


9.2 


100203 .BE242284 Hs.172199 


adenylate cyclase 7 


4.7 


47 


1 


4.3. 


100210 026361 Hs.3104 


K1AA0042 gene product 


4.7 


47 


4 


0.7 


100219 AW972300 Hs.118110 


bone marrow stromal cell antigen 2 


3.8 


350 


93 


1.9 


100234 D29677 Hs.3085 


KIAA0054 gene product HeOcase 


4.1 


64 


16 


3 


100248 NMJ)15156Hs.78398 


KIAA0071 protein 


3.4 


77 


23 


5.9 


100252 NMJ)06207Hs.170040 


platelet-derived growth factor receptor- 


4.5 


45 


4 


4 


100260 038491 Hs.322478 


KJAA01 17 protein 


5.9 


59 


1 


2.6 


100279 042084 Hs.82007 


KIAA0094 protein 


3.5 


96 


28 


1.3 


100286 BE247550 Hs.86859 


growth factor receptor-bound protein 7 


3.1 


306 


98 


1.5 


100294 AA331881 Hs.75454 


peroxiredoxin 3 


12.8 


128 


1 


11.7 


100335 AW247529 Hs.6793 


platelet-activating factor acetylhydrota 


A2 


187 


44 


5.4 


100365 AI878927 Hs.79284 


mesoderm specific transcript (mouse) horn 


4.5 


129 


29 


3,1 


100375 D80004 Hs.75909 


KIAA0182 protein 


3.5 


78 


23 


4.8 


100409 D86957 Hs.80712 


KIAA0202 protein 


10.2 


102 


1 


4.8 


100410 D86961 Hs.79299 


Opoma HMGIC fusion partner-like 2 


4 


40 


1 


3.8 


100414 NM_014735Hs.82292 


WAA0215 gene product 


3.2 


32 


2 


29 


100418 D86978 Hs.84790 


KIAAQ225 protein. 


3.6 


36 


7 


3.2 


100438 AA013051 Hs.91417 


topoisomerase (DNA) II binding protein 


5.6 


76 


14 


2 


100439 AA347720 Hs.122669 


KIAA0264 protein 


3.5 


35 


9 


3.1 


100448 AF234887 Hs.57652 


cadherin, EGF LAG seven-pass G-type rece 


5.5 


145 


27 


2.2 


100449 D87470 Hs.75400 


KIAA0280 protein 


3.4 


34 


1 


1.2 


100522 X51501 Hs.99949 


proIac&Hnduced protein 
lysosomal 


217 


760 


34 


1.4 


100552 AA019521 Hs.301946 


14.4 


144 


9 


4.7 


100643 NM_005032Hs.4114 


ptastin3(Tisoform) 


4.1 


259 


63 


1.9 


100661 BE623001 Hs.132748 


Homo sapiens rtbosomai protein L39 mRNA, 


3.3 


116 


36 


2.2 


100666 L05424 Hs.169610 


CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


100667 L05424 Hs.169610 


CD44 antigen (homing function and Indian 


3 


594 


201 


2.3 


100745 BE207168 Hs.144630 


nuclear receptor subfamily 2, group F, m 


5 


82 


17 


0.9 


100774 J05581 Hs.89603 


mucin 1 , transmembrane 


3.5 


37 


11 


2,8 


100783 AF078847 Hs.191356 


general transcription factor HH, polype 


9.7 


97 


10 


72 


100821 M26460 


gb:Homo sapiens (clone 104) retinobiasto 


3.3 


33 


1 


0.8 


100864 BE563957 Hs.74861 


activated RNA polymerase li transcriptk) 


3.7 


477 


130 


3.1 


100877 X80821 Hs27973 


K1AA0874 protein 


6.3 


63 


4 


5.7 


100892 BE245294 Hs.180789 


S164 protein 


4.7 


47 


1 


A2 


101038 BE297139 Hs.79411 


replication protein A2 (32kD) 


3.8 


115 


30 


7.1 


101046 K01160 


NM_002122:Homo sapiens major histocompat 


3.9 


390 


100 


11.1 


101079 BE264901 Hs.25G502 


carbonic anhydrase VIII 


3.9 


39 


8 


3.6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



101084 AW409934 Hs.75528 


nucleolar GTPase 


4.1 


53 


13 


4 


101104 AW862258 Hs.169266 


neuropeptide Y receptor Y1 
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107318 T74445 Hs.5957 
107485 AL042613 Hs.262476 
107612 AI498986 Hs.60090 
107638 A1580492 Hs.42743 
107727 AA149707 Hs.173091 
107859 AW732573 Hs.47584 
107876 AW372451 Hs.61184 
107884 AA054949 Hs.61307 
107886 AA025782 Ks.61284 
107908 AF087999 Hs.42826 
107922 BE153855 Hs.61460 



nuclear factor l/B 9.9 

KIAA1323 protein 6.7 

Homo sapiens done 015M2 My015 protein 3.3 

K1AA1463 protein 3.8 

hypothetical protein FU20550 3.3 

hypothetical protein FU10955 4.5 

ESTs 7 

K1AA0406 gene product 3.2 

ESTs 3 8 
Homo sapiens mRNA; cOMA DKFZp586M0723 (f 16 

Homo sapiens cDNA: FU21487 fis, done C 4.9 

Homo sapiens clone 25142 mRNA sequence 3.1 

hypothetical protein FU20303 3.1 

gEoma-ampEfied sequence-41 3.1 

cofector required for Sp1 transcriptiona 5.5 

ESTs 4.4 

KIAA1116 protein 7.4 

ESTs 15.2 

Homo sapiens cDNA FU1Q071 fc, clone HE 3.8 

ESTs, Moderately similar to ALU7_HUMAN A 4.9 

ATP-dependant interferon response protei 5.8 

Homo sapiens done 23570 mRNA sequence 5.3 



Homo sapiens mRNA; cONA DKFZp564O0122 (f 5.4 



D1PB protein 3.6 

hypothetical protein FU22060 5.7 

hypothetical protein MGC4248 4.4 

TIA1 cytotoxic granule-associated RNA-bi 4.3 

triple functional domain (PTPRF Interact 4.6 

K1AA0753 gene product 3.5 

WAA1323 protein 5.4 

molecule possessing ankyrin repeats indu 3.3 

hypothefical protein FU20477 3.8 

mannosidase, alpha, class 1 A, member 1 4.3 

hypo&ieticaJ protein FU1 1269 3.6 

RAN binding protein 6 4.5 

Homo sapiens, done IMAGE:3685398, mRNA, 5.7 

ESTs 3.5 

COA14 5.5 

ESTs, Weakly similar to I38022 hypothetj 3.2 

K1AA1593 protein 5.1 

gb:ae36h1 2.s1 Gessler Wilms tumor Homo s 3.3 

sucdnate-CoA ligase, GDP-forming, beta 3.1 

programmed cell death 8 (apoptosis-induc 3.1 

edonucieo^epyrophc^phatase/phosphodi 3.1 

hypothetical protein DKFZp566G1424 3.9 

KIAA1344 protein 6.3 

ESTs 4.6 

GK001 protein 2.5 

ESTs 15.6 

DKFZP586E1621 protein 4.8 

tumor rejection antigen (gp96) 1 3.4 

ESTs 3.6 

nuclear receptor co-re pressor/HDAC3 comp 5.4 

UOP^cety^pha-0^alactosamrne:poryp 4.6 

hypoffietical protein MGC4606 1 2.5 
Homo sapiens mRNA; cONA DKFZp586l0324 (f 3.2 

Homo sapiens clone 24416 mRNA sequence 3.5 

S-adenosylmethionine decarboxylase 1 5.6 

Homo sapiens cONA FU13595 fis, done PL 3.2 

hypothetical protein 4.4 

ubtquiWke 3 3.5 

potassium voltage-gated channel, delayed 5.7 

CGI-79 protein 3.5 

ESTs 4.3 

ESTs 3.1 

ESTs 4.7 

Ig superfamBy receptor LNtR 9 
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107994 AA036811 Hs.48469 


UM domains containing 1 


4.5 


45 


1 


3.8 


108040 AL121031 Hs.159971 


SWi/SNF related, matrix associated, acti 


6.5 


65 


2 


6 


108055 AJ404672 Hs.334483 


hypothetical protein RJ23571 


7.4 


74 


8 


6 


108063 BE548479 Hs.14B38 


hypothetical protein FU10773 


3.4 


34 


1 


2.3 


108339 AW151340 Hs.51615 


ESTs, Weakly similar to ALU7.HUMAN ALU S 


18.7 


187 


1 


17 


108467 AI478658 Hs.94631 


brefeMin A-inhMed guanine nucleotide 


3.8 


38 


1 


3.2 


108539 AA084677 Hs.54558 


hypothetical protein RJ22222 


5.7 


57 


1 


4.9 


108634 AW022410 Hs.69507 


ESTs 


3.2 


32 


5 . 


17 


108647 BE546947 Hs.44276 


homeoboxCIO 


87 


247 


29 


57 


108695 AB029000 Hs.70823 


KIAA1077 protein 


3.7 


625 


168 


3.8 


108778 AF133123 Hs.90847 


general transcription factor NIC. polyp 


3.7 


37 


1 


3.2 


108806 AF070578 Hs.71168 


Homo sapiens done 24674 mRNA sequence 


3.4 


34 


1 


2.8 


108807 AI652236 Hs.49376 


hypothetical protein FU20644 


3.5 


35 


1 


3.2 


108810 AW295647 Hs.71331 


hypothetical protein MGC5350 


5.3 


53 


1 


2.8 


108846 AL117452 Hs.44155 


DKFZP586G1517 protein 


4.8 


96 


20 


6.5 


108857 AK001468 Hs.62100 


anQlin (DrosophOa Scraps homolog), act 


5.4 


54 


1 


4 


108893 BE276891 Hs.194691 


retinoicacid Induced 3 


3.1 


529 


170 


4.1 


108917 AI380268 Hs.173648 


ESTs, Weakly similar to Zinc-finger prot 


3.3 


33 


5 


17 


109010 NMJJ07240HS.44229 


dual specificity phosphatase 12 


3.4 


34 


1 


2.6 


109060 BE062109 Hs.241551 


chloride channel, calcium activated, fam 


3.1 


31 


8 


2 


109101 AW608930 Hs.52184 


hypothetical protein FU20618 


3.4 


71 


21 


2.4 


109112 AW419196 Hs.257924 


hypothetical protein RJ13782 


4.1 


334 


82 


3.4 


109124 AK000684 Hs.183887 


hypothetical protein FU22104 


3.3 


33 * 


1 


2.9 


109128 H89083 Hs.181915 


ESTs 


4 


40 


7 


1.1 


109160 BE220601 Hs.301997 


hypothetical protein FU13033 


3.8 


233 


62 


3.8 


109166 AA219691 Hs.73625 


RAB6 interacting, kinesln-iike (rabkines 


8.8 


199 


23 


16.1 


109173 AA179962 Hs.73643 


EST 


3.2 


32 


1 


2.2 


109178 AW976516 Hs.283707 


Homo sapiens cONA: FU21354 fis, clone C 


3.2 


32 


10 


2.9 


109235 AI381800 Hs.300684 


calcitonin gene-related peptide-receptor 


4.9 


121 


25 


10.4 


109273 AA375752 Hs.82719 


Homo sapiens mRNA; cDNA OKFZp586F1822 (f Z9 


114 


39 


9.9 


109292 AW975746 Hs.188662 


KIAA1702 protein 


7.1 


71 


1 


6.5 


109391 AL096858 Ks.184245 


K1AA0929 protein Msx2 interacting nuctea 


6.9 


69 


5 


6.2 


109411 R98881 Hs.109655 


sex comb on mtdleg (DrosophilaHike 1 


3.3 


39 


12 


1.5 


109412 BE543313 Hs.209473 


hypothetical protein FU10520 


4.2 


56 


14 


2.2 


109415 U80736 Hs.1 10826 


trinucleotide repeat containing 9 


12.3 


123 


1 


1U 


109481 AA878923 Hs.289069 


hypothetical protein FU21016 


3.2 


286 


91 


57 


109517 AI631874 Hs.155140 


casein kinase 2, alpha 1 polypeptide 


8.3 


83 


8 


1.9 


109597 AA989362 Hs.293780 


ESTs 


5.9 


59 


10 


4.2 


109729 F10024 Hs.268740 


ESTs 


3.2 


41 


13 


3.3 


109795 AA173942 Hs.326416 


Homo sapiens mRNA; cONA DKFZp564H1916 (f 5.9 


208 


36 


1.8 


109799 AW965076 Hs.180378 


hypothetical protein 669 


5 


50 


5 


4.1 


109883 R68827 Hs.95011 


syntrophin, beta 1 (dystrophin-associate 


3.7 


37 


4 


2 


109912 AW390822 Hs.301528 


L-kynurenine/alpha-amlnoadipate aminotra 


14.2 


142 


1 


9.5 


109937 AI084066 Hs.20072 


myosin regulatory light chain interacts 


4.1 


41 


7 


17 


109958 AA001266 Hs.133521 


ESTs 


\2 


58 


14 


0.8 


109984 AI796320 Hs.10299 


Homo sapiens cDNA FU13545 fis, done PL 


3.2 


136 


43 


3.6 


110009 BE075297 Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


6.3 


693 


110 


7.? 


110240 AJ668594 Hs.176588 


ESTs, Weakly similar to CP4YJWMAN CYTOC 4.6 


913 


199 


2.9 


110369 AK000768 Hs.107872 


hypothetical protein RJ20761 


3.8 


38 


7 


• 2.8 


110426 AI610702 Hs.28212 


ESTs, Weakly similar to TRHY J-IUMAN TRICH 6.7 


78 


12 


3 


110478 H11236 Hs.31034 


peroxisomal biogenesis factor 1 1 A 


3.7 


37 


1 


2.1 


110481 AF075089 Hs.36823 


ESTs 


3.6 


36 


10 


2.5 


110581 H61560 


gb;yr22g03 .si Soares fetal fiver spleen 


3.3 


33 


1 


1.8 


110674 AA071276 Hs.19469 


KIAA0859 protein 


3,5 


35 


8 


1.9 


110705 AB007902 Hs.32168 


KIAA0442 protein 


3.6 


282 


79 


1.7 


110721 H97678 Hs.31319 


ESTs 


4.4 


103 


24 


3.8 


110731 NM_014899HS.188006 


K1AA0878 protein 


3.3 


138 


42 


3.6 


110769 BE000831 Hs.23837 


Homo sapiens cDNA FU11812 fis, done HE 


13.5 


135 


1 


5.1 


110775 N22414 


gbryw39a07 .si Welzmann Olfactory Epfthel 


5.4 


54 


1 


37 


110787 AA831267 Hs.12244 


hypoflietlcal protein FU20097 


4.7 


47 


4 


42 


110799 AI089660 Hs.323401 


dpy-30-like protein 


5 


50 


1 


4.3 


110818 AL157503 Hs.27552 


Homo sapiens mRNA; cONA DKFZp586N2424 (f 3.1 


31 


1 


27 


110839 AF153330 Hs.30246 


solute carrier family 19 (thiamine trans 


8.4 


84 


1 


5.3 


110844 AI740792 Hs.167531 


methylcrotwwyKCoenzyme A carboxylase 2 


10.5 


105 


4 


7.1 


110854 BE612992 Hs.27931 


hypothetical protein FU10607 similar to 


7.9 


79 


1 


6.2 


110882 AW963705 Hs.301183 


molecule possessing ankyrin repeats Indu 


3.9 


353 


90 


\2 


110908 AI433165 Hs.9856 


ESTs 


3.1 


31 


1 


1.3 
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110915 BE092285 Hs.29724 
110930 BE242691 Hs.14947 
110970 Y19062 Hs.96870 
111084 H44186 Hs.15456 
5 111125 N63823 Hs.269115 
111132 AB037807 Hs.83293 
111164 N46180 Hs.122489 
111179 AK000136 Hs.10760 
111184 AI815486 Hs.243901 
10 111190 AKO02055 Hs.151046 
111221 AB037782 Hs.15119 
111223 AA852773 Hs.334838 
111229 AW389845 Hs.110855 
111234 AA902656 Hs.21943 
15 111241 AA345644 Hs.288880 
111345 AW253155 Hs.14559 
111353 W20090 Hs.6616 
111357 BE314949 Hs.87128 
111378 AW160993 Hs.326292 
20 111389 AK000987 Hs.169111 
111540 U82670 Hs.9786 
111806 BE071382 Hs.279008 
111884 AW502285 Hs.127236 
111923 BE383234 Hs.25925 
25 111929 AF027208 Hs.112360 
111942 R40576 Hs.21590 
111987 NNL015310HS.6763 
112092 R44538 
112134 R41823 Hs.7413 
30 112197 NM.003655Hs.5637 
112198 AM32672 Hs.288539 
112244 AB029000 Hs.70823 
112253 R51818 
112269 R53734 Hs.25978 
35 112275 AW972635 Hs.301904 
112280 AA863360 Hs.26040 
112305 AK000914 Hs.26244 
112483 AW969785 Hs.285885 
112513 R68425 Hs.13809 
40 112571 AA412205 Hs.140996 
112971 Z42387 Hs.83883 
113023 AL134324 Hs.7312 
113047 AI571940 Hs.7549 
113073 N39342 Hs.103042 
45 113083 AA283057 Hs.266957 
113287 T66847 Hs.194040 
113296 AW449560 Hs.89576 
113523 A1791905 Hs.95549 
113604 A1075407 
50 113617 AI869372 
113702 T97307 
113783 AL359588 
113791 AI269096 
113794 T62849 
55 113804 BE247683 Hs.14611 
113808 W44735 Hs.9286 
113847 NWL005032HS.4114 
113849 AA457211 Hs.8858 
113867 AW0G2834 Hs.24095 
60 113886 W76027 Hs.23920 
113923 AW953484 Hs.3849 
113936 W17Q56 Hs.83623 
113950 AI267652 Hs.30504 
114030 A18253B6 Hs.164478 
65 114051 AB026436 Hs.177534 
114057 AF1 16653 Hs.34192 
114082 AK001612 Hs.26962 



Hs.296083 
Hs.17207 

Hs.7041 

Hs.135578 

Hs.11090 



hypothetical protein RJ13187 20.9 209 1 19.5 

ESTa 3.4 115 34 2.4 

staufen (Drosophfla, RNA-b!nding protein 3.5 35 9 3.2 

PDZ domain containing 1 4.3 43 1 2 

ESTs, Moderately similar to Z195JWMANZ 5.4 54 1 4.3 

hypothetical protein 12 72 10 6.1 

Homo sapiens cDNA FU13289 fc, clone OV 7.7 77 1 5 

asporin(LRR class 1) 25.1 288 12 67 

Homo sapiens cDNA FU20738 fis, clone HE 3.9 146 37 9.8 

hypothetical protein FU11193 6.3 63 1 5.8 

WAA1361 protein 3.7 119 33 6.7 

KIAA1866 protein 3.6 402 112 4.9 

ESTs 4.3 43 1 1 

NIF3(Ngg1 Interacting factor 3, S.pombe 3.3 33 1 1.1 

PAN2 protein 4.8 61 13 5.6 

hypothetical protein RJ10540 4.3 43 5 2.2 

ESTs 4.1 41 1 2.6 

hypothetical protein FU233Q9 3.8 425 111 4 

hypothetical gene DKFZp434A1114 4.3 65 15 5.7 

oxidation resistance 1 3.4 314 91 2.4 

zinc finger protein 275 3.5 35 1 2.1 

hypothetical protein BJ20170 3.5 105 30 9.6 

hypothetical protein FU12879 3.2 37 12 3.5 

Homo sapiens, done MGC:15393, mRNA, com 6.2 62 2 5.9 

proiriinin(rnouseKike1 8.1 328 41 1.7 

hypothetical protein DKFZp564O0523 4.2 125 30 7.4 

K1AA0942 protein 6.5 65 10 1.5 

gb:yg29c02.s1 Scares Infant brain 1N1B H 3.3 33 10 13 

ESTs; calsyntenln-2 6.1 185 31 6.6 

ESTs 3.5 507 145 3.3 

hypothetical protein FU22191 3.5 40 12 2.5 

WAA1077 protein 5.7 567 100 6.7 

gb:yg77h1Zs1 Scares Infant brain 1NIBH 4 70 18 6.8 

ESTs. Weakly similar to 210926QAB cell 3.7 37 1 3 

hypothetical protein RJ12671 4.3 45 11 4.4 

ESTs, Weakly similar to fatty acid omega 2.8 751 270 1.3 

hypothetical protein FU10052 3.5 41 12 3.7 

Homo sapiens cDNA FU1 1321 lis, clone PL 4.2 42 6 3.6 

hypothetical protein FU10648 4.7 54 12 4.5 

ESTs 4* 48 2 3.4 

transmembrane, prostate androgen Induced 4.5 390 87 5.3 

ESTs 3.2 99 31 3.1 

ESTs 9-6 124 13 9 

microtubule-assoclated protein 1B 9.1 91 6 8.3 

hypothetical protein FU14281 * 6.5 65 6 4.8 

ESTs. Weakly similar to 138022 hypothetl 3.5 35 1 1.4 

inner mitochondrial membrane peptidase 2 3.5 35 4 3.3 

hypothetical protein 7.6 76 1 42 

ESTs, Moderately similar to 154374 gene 3.1 453 148 7 

Homo sapiens cDNA FU1 1922 fe, clone HE 3.6 36 4 2.6 

gb.ye53h05.s1 Scares fetal liver spleen 12.3 129 11 11.7 

hypothetical protein DKFZp762B226 4.6 46 4 4.3 

chitobiase,dl-N-acetyl- 3.6 36 1 1.2 

membrane-spanning 4-domains, subfamily A 3.3 744 227 2.5 

dual specMy phosphatase 11 (RNA/RNP 3.3 180 54 2.1 

Homo sapiens cDNA: FU21278 fis, clone C 5.1 51 5 4.5 

plasfin3(Tlsoform) 32 238 75 2.1 

bromodomain adjacent to zinc finger doma 4.3 43 8 3.6 

ESTs 6.1 110 18 10.2 

hypothetical protein FU11105 4 48 12 4 

hypothetical protein FU22041 similar to 3.7 239 65 3.6 

nudea/receplorsuhfemIly1,groupl ( ni 4.3 819 191 \2 

Homo sapiens rr«N^cDNADKFZp434E082(fr 10.7 123 12 7 

hypothetical protein FU21939 similar to 4.4 44 6 22 

dual specificfty phosphatase 10 4.5 45 4 2.6 

Homo sapiens PRO0323 mRNA, complete cds 3.5 35 6 Z2 

Homo sapiens cDNA FU10750 fis, done NT 3.1 31 5 1j5 
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114124 
114138 
114162 
114196 
114208 
114239 
114251 
114306 
114460 
114542 
114652 
114767 
114768 
114774 
114798 
114821 
114860 
114918 
114940 
114965 
114969 
114988 
115004 
115054 
115061 
115140 
115142 
115188 
115206 
115221 
115262 
115291 
115536 
115583 
115600 
115622 
115646 
115674 
115675 
115719 
115725 
115764 
115821 
115825 
115839 
115844 
115892 
115967 



W57554 Hs.125019 
AW384793 Hs.15740 
AF155661 Hs.22265 



AF017445 
AL049466 
AL137667 
H15261 



Hs.150926 
Hs.7859 
Hs.267445 
Hs.21948 



116097 
116107 
116127 
116129 
116204 
116226 
116238 
116250 
116256 
116298 
116336 
116351 
116365 
116379 
116429 
116450 
116461 
116470 



AF100143 Hs.6540 
AF183810 Hs.26102 
AW970128 Hs.91011 
AI521936 Hs.107149 
AI859865 Hs.154443 
AF212848 Hs.182339 
AV656017 Hs.184325 
AA159181 Hs.54900 
AI648602 Hs.55468 
AL157545 Hs.42179 
BE165762 Hs.23518 
BE092696 Hs.75928 
AI733881 Hs.72472 
AW162998 Hs.24684 
AA251089 
AA329340 Hs.4867 
AW265668 Hs.87729 
AI751438 Hs.41271 
NWL014158HS.279938 
AI623693 Hs.191533 
AK000219 Hs.88367 
AW183695 Hs.186572 
AW365434 Hs79741 
AI422867 Hs.88594 
BE545072 Hs.122579 
AK001468 Hs.62180 
N^012317Hs.45231 
AA081395 Hs.42173 
AI088691 Hs.208414 
N36110 Hs.305971 
AW992356 Hs.8364 
W87707 Hs.82065 
AW992405 Hs.59622 
AW899053 Hs.76917 
AW582256 Hs.91011 
AW338063 Hs.130985 
R50956 Hs.159993 
BE300266 Hs.28935 
AI373062 Hs.332938 
AA291377 Hs.50831 
AI745379 Hs.42911 
AW673312 Hs.50848 
AI198719 Hs.176376 
AL133916 Hs.172572 
AF126743 Hs.279884 
AF189011 Hs.49163 
AW861622 Hs.108646 
AW976438 Hs.17428 
AV660717 Hs.47144 
N76712 Hs.44829 
AA328153 Hs.88201 
AI955411 Hs.94109 
AL133033 Hs.4084 
AL133623 Hs.62501 
N50174 Hs.46765 
AA448588 Hs.71252 
AF191018 Hs.279923 
AI654450 Hs.47274 
AA313607. HS.5B633 
AI272141 Hs.83484 



lymphoid nuclear protein (LAF-4) mRNA 24.2 
Homo sapiens mRNA; cONA OKFZp434E033 (fr 6.7 
pyruvate dehydrogenase phosphatase 3.8 
fucose-1 -phosphate guanylyltransf erase 4.4 
ESTs 5.7 
Homo sapiens mRNA; cDNA DKF2p434B231 (fr 3.3 
ESTs 4.2 
fibroblast growth factor 13 4.5 
trichorhinophatangea! syndrome ! 4.4 
anterior gradient 2 (Xenepus taevls) horn 4.7 
novel protein similar to archaeal, yeast 5.2 
mlnlchromosome maintenance deficient (S. 4.6 
ets homologous factor 1 3.7 

CGI-76 protein 3.3 
serologically defined colon cancer antig 7.4 
ESTs 4.7 
bromodomain and PHD finger containing, 3 9. 1 
hypothetical protein from BCRA2 region 10.1 
ESTs 6.4 
BMP-R1B 35.9 
KIAA1376 protein 9.4 
gb:zs04f05.s1 NCLCGAPJ2CB1 Homo sapiens 11.5 
mannosyl (alpha-1,3-)-glycoprotein beta- 4.2 
hypothetical protein RJ12428 5.1 
Homo sapiens mRNA full length insert cDN 4.5 
HSPC067 protein 4.8 
ESTs 3.2 
hypothetical protein FU20212 3.3 
ESTs 5.8 
hypothetical protein FU10116 5.5 
ESTs 11.2 
hypothetical protein RJ 10461 4.5 
anilKn (Drosophila Scraps homolog), act , 5.9 
leucine zipper, down-regulated in cancer 9.8 
Homo sapiens cONA FU10366 fls, clone NT 4.6 
Homo sapiens mRNA; cONA DKFZp564D0472 (f 4.4 

3.2 
10.2 
5.2 
7,6 
3.1 
5.7 
3.9 
4.2 
5.8 
6.2 
3.2 
8.4 
3.6 
5.1 
3.4 
3.5 
4.5 
5.2 
3.8 
5.1 
13.3 
3.3 
4.8 
3.2 
3.7 
3.9 
5.6 
3.6 



solute earner family 2 (facilitated glu 
Homo sapiens pyruvate dehydrogenase klna 
tnterfeuWn 6 signal transducer (gp1 30, 
Homo sapiens, done IMAGE:3507281 , mRNA, 
F-boxoniy proteins 

anterior gradient 2 (Xenepus taevls) horn 
zinc-finger protein ZBRK1 



transdudn-like enhancer of split 1, horn 
hypothetical protein MGC5370 
ESTs 
ESTs 

hypothetical protein FU20331 
ESTs 

hypothetical protein BJ20093 
DNAJ domain-containing 
putative ribonudease III 
Homo sapiens cONA FU14934 fis. done PL 
RBP1-fike protein 
DKFZP586N0819 protein 
ESTs, Weakty similar to 138022 hypothec 
ESTs, Weakly similar to A Chain A, Cryst 
Homo sapiens cDNA FU13634 fis, done PL 
KIAA1025 protein 

similar to mouse Xm1 / Dhm2 protein 
ESTs 
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AU076916 Hs.5398 


guanine monphosphate synthetase 
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50 
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132668 


AB018319 Hs.5460 


KIAA0776 protein 


4.2 


171 


41 


12.6 


132742 


AA025480 Hs.292812 


ESTs, Weakly similar to T33468 hypotheti 


6.5 


65 
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5.6 


132790 


AW242243 Hs.168670 


peroxisomal famesylated protein 


3.7 


37 
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2.2 


13281V 


U25435 Hs.57419 


CCCTC-binding factor (zinc finger protel 


7 


115 


17 


5.4 


132852 


AL120050 Hs.58220 


Homo sapiens cDNA: FU23005 Hs, done L 


3.3 


61 


19 


5.1 


132856 


NMJ01448HS.58367 


giypican4 


4.8 


48 


1 


3.6 


132880 


BE077155 Hs.177537 


hypothetical protein DKFZp761B1514 


116 


126 


8 


9.9 


132902 


AI938442 Hs.59838 


hypothetical protein FU 10808 


11 


187 


17 


10.4 


132906 


BE613337 Hs.234896 


geminin 


3.3 


f06 


33 


2.6 


132914 


AL047045 Hs.60293 


Homo sapiens clone 122482 unknown mRNA 


3.5 


110 


32 


2.1 


132968 


AF234532 Hs.61638 


myosin X 


4.1 


62 


15 


4.9 


132977 


AA093322 Hs.301404 


RNA binding motif protein 3 


22.1 


221 


9 . 


17.8 


132990 


X77343 Hs.334334 


transcription factor AP-2 alpha (adivat 


12.7 


311 


25 


2.4 


132994 


AA1 12748 Hs.279905 


doneHQ0310PRO0310p1 


3 


380 


127 


5.5 


133011 


NMJ)06379Hs,171921 


sema domain, immunoglobulin domain ((g), 


7.3 


271 


37 


2,3 


133015 


AJ002744 Hs,246315 


UDP^^tyi^ha^^alactosamine:polyp 


4.6 


427 


93 


10.4 


133070 


U92649 Hs.64311 


a disintegrin and metaffoproteinase dome 


3.6 


36 
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3.1 


133091 


AK001628 Hs.64691 


KIAA0483 protein 


5.2 


117 


23 
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133192 


AA218564 Hs.67052 


vacuolar protein sorting 26 (yeast homo! 


3.1 


359 


118 


2.5 


133197 


AI275243 Hs.180201 


hypothetical protein FU20671 


5.1 


58 


12 


5.7 


133199 


AF231981 Hs.250175 


homotog of yeast long chain potyunsatura 
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816 


275 


3.9 


133221 


W32474 Hs.301746 


RAP2A, member of RAS oncogene family 


3.1 


234 


76 


8.6 


133240 


AK001489 Hs^42894 


ADP-ribosylation factor-like 1 


8.1 


81 
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4.6 


133271 


Z48633 Hs^83742 


H^apiens mRNA for retrotransposon 


12,4 


124 
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10.8 


133291 


BE297855 Hs.69855 


NRAS-related gene 


3.3 


33 


1 
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133294 


AJ001388 Hs.69997 


zinc finger protein 238 


7.9 


234 


30 


18.9 


133350 


AI499220 Hs.71573 


hypothetical protein FU10074 


4.6 


46 
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3.5 


133362 


AK001519 Hs.7194 


CGI-74 protein 
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110 


22 


9.7 


133370 


AF245505 Hs.72157 


DKFZP56411922 protein 


3.2 


725 


227 


3.2 


133407 


AF017987 Hs.7306 


secreted frizzled-related protein 1 


4.1 


374 


91 


1.1 


133422 


AB033061 Hs.73287 


KJAA1235 protein - 


4.3 


43 


1 


3.9 


133435 


A1929357 Hs.323966 


Homo sapiens done H63 unknown mRNA 


5.5 


186 


34 


16.5 


133479 


W01556 Hs.238797 


ESTs, Moderately similar to 138022 hypot 


3.5 


35 


7 


2.1 


133493 


AW998046 Hs.194369 


arglnine-glutamic add dipeptide (RE) re 


3.6 


39 


11 


0.4 


133504 


NM_004415Hs.74316 


desmoplaWn(DPI,DPII) 


4.1 


640 


158 


3 


133517 


NM.000165HS74471 


gap junction protein, alpha 1, 43kO (con 


3.2 


351 


111 


5.2 


133536 


W25797 Hs.177486 


amyloid beta (A4) precursor protein (pro 


3.2 


226 


71 


2.8 


133578 


AU077050 Hs.75066 


translin 


3.4 


178 


53 


6.8 


133633 


D21262 Hs.75337 


nucleolar and coBed-body phosphproteln 


4.7 


47 


1 


4 


133640 


AW246428 Hs.75355 


ublquitin-conjugating enzyme E2N (homolo 


8.5 


85 


1 


72 


133669 


NM_006925Hs.166975 


spridng factor, argiru'ne/serine-rich 5 


3.6 


36 


1 


0.4 


133681 


AI352558 Hs.75544 


tyrosine 3-monooxygenase/tryptophan 5-mo 


3.4 


234 


68 


10.7 


133746 


AW410035 Hs75862 


MAD (mothers against decapentaplegic, Or 


9.3 


93 


1 


7.8 
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133765 M62194 Hs75929 
133780 AA557660 Hs.76152 
133784 BE622743 Hs.301064 
133814 NM_002462Hs.76391 
5 133829 AW630088 Hs.76550 
133845 AA147026 Hs.76704 
133913 AU076964 Hs.7753 
133968 AA355986 Hs.232068 
133990 R48316 Hs7822 
10 133999 AA535244 Hs.78305 
134032 NM.005025HS.78589 
134064 AF091622 Hs.78893 
134087 U51166 Hs.173824 
134089 R51273 Hs79029 
15 134095 NM_004354Hs79069 
134098 BE513171 Hs.79086 
134110 U41060 Hs.79136 
134125 NM.014781HS.50421 
134246 D28459 Hs.80612 
20 134257 C05768 Hs.8078 
134272 X76040 Hs.278614 
134282 R45621 Hs.81057 
134288 A1022650 Hs.8117 
134321 BE538082 Hs.8172 
25 134326 AW903838 Hs.81800 
134328 AW959281 Hs.8184 
134348 AW291946 Hs.82065 
134359 NMJ)01982Hs.199067 
134367 AA339449 Hs.82285 
30 134374 N22687 Hs.8236 
134380 AU077143 Hs.179565 
134395 AA456539 Hs.8262 
134401 A1916662 Hs,211577 
134405 AW067903 Hs.82772 
35 134415 AI750762 Hs.82911 
134417 NMJX)6416Hs.82921 
134419 W95642 Hs.82961 
134421 AU077196 Hs.82985 
134436 U29344 Hs.83190 
40 134485 X82153 Hs.83942 
134487 AF061739 Hs.83954 
134495 063477 Hs.B4087 
134520 BE091005 Hs.74861 
134542 M14156 Hs.85112 
45 134570 U66615 Hs.17228G 
134590 AW903849 Hs.173840 
134604 NfA.002884Hs.865 
134612 AW068223 Hs.171581 
134643 AW299723 Hs.87223 
50 134654 AK001741 Hs.8739 
134656 AI750878 Hs.87409 
134672 AF271212 Hs.322901 
134700 AK000606 Hs.8868 
134711 X04011 H$,88974 
55 134722 AF129536 Hs.284226 
134856 BE281128 Hs.9030 
134880 AI879195 Hs.90606 
134917 X87241 Hs.166994 
134921 AL137491 Hs.125511 
60 134982 AKD02085 Hs.92308 
134989 AW968058 Hs.92381 
135029 H58818 Hs.187579 
135035 AL034344 Hs.284186 
135051 AI272141 Hs.83484 
65 135062 AKD00967 Hs.93872 
135073 W55956 Hs.94030 
135098 AW274526 Hs.277721 




cadherin 1 1 , type 2, OB-cadherin (osteob 3.2 

decorin 5.4 

arfaptin 1 4.7 

myxovlnjs (influenza) resistance 1 , homo 3.3 
Homo sapiens mRNA; cDNA DKFZp564B1264 {f 6.7 

ESTs 6.2 

calumenin 3.3 

transcription factor 8 (represses tnteri 3.7 
Homo sapiens mRNA; cDNA DKFZp564C1216 (f 3.4 

RAB2. member RAS oncogene family 7.8 

serine (or cysteine) proteinase inhibtto 5.9 

KIAA0244 protein 5.8 

Ihymlne-DNA glycosylase 6.4 

ESTs 5.1 

cyc!inG2 5 

mitochondrial ribosomal protein L3 4.8 

UV-1 protein, estrogen regulated 4.5 

K1AA0203 gene product 4.6 

ubiquith-conjugating erayme E2A (RAD6 h 7 

Homo sapiens clone FBD3 Cri-du-chat crit 3.4 

protease, serine, 15 3.6 

hypothetical protein MGC2718 6.7 

erbb2-interacting protein ERBIN 4.5 

ESTs, Moderately similar to A46010 X-On 5.2 

chondroltin sulfate proteoglycan 2 (vers 8.6 

ESTs 4.8 

interleukin 6 signal transducer (gp130, 7.1 

v-erb-b2 avian erythroblastic leukemia v 3 

phosphoribosyigiycinamlde fofrnyhransfer 4.4 

ESTs 13.3 

minlchromosome maintenance deficient (S. 4.5 

lysosomal 6 

kinecfin 1 (kinesin receptor) 4.1 

collagen, type XI, alpha 1 4.6 

protein tyrosine phosphatase type IVA, m 4.9 

solute earner family 35 (CMP-siaiic aci 4.9 

trefoil factor 3 (intestinal) 3.2 

collagen, type V, alpha 2 6.3 

fatty acid synthase 3.3 

cathepsinKfpycnodysostosis) 34.3 

protein associated with PRK1 4.8 

WAA0143 protein 3.1 

activated RNA polymerase 11 transcriptio 3.3 

insulin-like growth {actor 1 (somatomedi 4.2 

SWI/SNF related, matrix associated, acti 3.9 

HUEL (C4orf1Hnteracting protein 37 

RAP1A, member of RAS oncogene family 5,2 

ubiquitin C-terminal hydrolase UCH37 4.9 

bone morphogenetic protein receptor, typ 5.2 

hypothetical protein FU10879 6.4 

thrombospondin 1 12.6 

disrupter of silencing 1 0 5.4 

golgl SNAP receptor complex member 1 3.4 

cytochrome b-245, beta polypeptide (chro 3.2 

F-box only protein 6 7 

TONDU 3.1 

15 kDa selenoprotein 5.7 

FAT tumor suppressor (Drosophfla) bomoJo 3.2 
Homo sapiens mRNA; cDNA DKFZp434P1530 (f 4 
Homo sapiens cDNA FU1 1 223 fis, clone PL 5.1 

nudix (nucleoside diphosphate linked mot 8.2 

hydroxysterold (17-beta) dehydrogenase 7 1 1 .5 

forkheadboxCI 5.4 

SRY (sex determining region Y)-box 4 3.3 

K1AA1682 protein 3.8 
Homo sapiens mRNA; cDNA DKF2p586E1624 (f 8.1 
ovarian carcinoma antigen CA125 3.3 
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135117 W52493 Hs.94694 Homo sapiens cDNA FU 10561 lis. clone NT 5.3 

1$5144 NM_016255Hs.95260 Autosorral Highly Conserved Protein 7.4 

135154 AKD01835 Hs.267812 sorting nexln 4 6.6 

135155 AI207958 Hs.166556 Homo sapiens, Similar to TEA domain faml 6.1 
135172 AB028956 Hs.12144 KIAA1033 protein 3.4 

135242 AI583187 Hs.9700 cyctinEI 3.1 

135243 BE453721 Hs.S7101 putative G protein-coupled receptor 3.4 
135269 NM_003403Hs.97496 YY1 transcription factor 3.4 

135356 BE312948 Hs.18104 hypothetical protein FU1 1274 3.1 

135357 Ai555004 Hs.79572 caihepsln D (lysosomal jKpartyl protease 4.7 
135389 U05237 Hs.99872 fetal Alzheimer antigen 20.6 
135397 L14922 Hs.166563 replication factor C (activator 1) 1 (14 3.2 
-135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 3.2 

AI471525 Hs.247486 ESTs 3.8 

X70683 Hs.93668 ESTs 1.8 

L14922 Hs.82128 5T4 oncofetal trophoblast glycoprotein 5 

M23263 Hs.904 amylo-1;6^lucostee;4^pr^lucanotransferas 

A1267886 Hs.148027 polymerase (RNA) ti (DNA directed) polypeptide B 

AA044840 Hs.241676 stromal cetMerived factor 1 47 

N90960 Hs.227459 ESTs; Moderately similar to III! ALU SUBFAMILY 

AA873285 Hs.137947 ESTs 4.7 

T56679 Hs.865 RAP1A; member of RAS oncogene family 4 

AA305536 "EST176522 Colon carcinoma (Caco-2) cell line II 

AI369384 arylsulfatase D 3.5 

AA219081 Hs.242396 ESTs; Moderately similar to !!!! ALU SUBFAMILY] 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigeneED's for Table 10. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbe/s 



Pkey CAT number Accession 



123619 371681 1 AA602984 AA6092Q0 

20 104602 524482.2 H47610R86920 

121581 283769 1 AA4 16568 AA442889 AA4 17233 AA442223 

123523 genbank_M608588 M608588 

100821 tjgr.HT4306 M26460 U09116 

125091 genbankJ91518 T91518 
25 125150 NOTJ : OUND.entrez - W38240 W38240 

118475 genbankJJ66845 N66845 

104787 genbankJ\A0Z7317 AA027317 

106055 genbankJM17034 AM17034 

113702 genbankJ97307 T97307 
30 101046 entrezJ<D1160 K01160 

101447 entre^M21305 M21305 

101624 entrezJ/155998 M55998 

124677 genbanK_R01073 R01073 

110581 genbanlLH61560 H61560 
35 119023 genbanX_N98488 N98488 

110775 genbankJI22414 N22414 

112092 genbankJW4538 R44538 

112253 genbank_R51818 R51818 

107014 genbanleAA598820 AA598820 
40 114988 genbanleAA251089 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

5 breS a Preferred ^ ° f gCneS W ,htod 111 tissue compared to normal 

Pkey: Unique Eos probeset Identifier number 

^emplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title; Ungene gene title 

R1: Ratio of tumor to normal body tissue 

i <- Ratio of 90* percentile tumor to normal body 

10 R3: Ratio of 75°' percentile normal body to tumor 

R4: Ratio of tumor to normal breast tissue 
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Pkey ExAccn UnigenelD Unigene Title 
100131 D12485 Hs.11951 



' £££ EX? ?«o^ 8 os ^tspedficfador2(fascidin 
^ 1501 Hs " 949 pnolactWnduced protein 

25 S So« KSS 10 CD44 ant|9en Mon and Indian 
ZO 101104 AW862258 Hs.169266 neuropeptide Y receptor Y1 

N^002B90Hs.758 RAS p21 protein activator (GTPase activa 
n°7^c^ I] 8 ' 620 ^touspemphigoidanUgen 1(23G/24o£) 
mm IZIL u illtl 9 ^^toxicgranule^sodatedRNAV 
in 3215?? ^ L049610 Hs.95243 transcription elongation factor A (SUV 
30 102165 BE313280 Hs.159627 death associated pmtein 3 
102304 AF015224 Hs.46452 rnammaglobin 1 

MM 5 ri 9 «Q,» S 1I« 9 ^ydedehydrogenaseSfamny.member 
102457 NMJX)1394Hs.2359 dual spedGdty phosphatase 4 
35 S S t^**47 TRAF family member-associated NFKB activ 
102823 D85390 Hs.5057 carboxypeotldase D 
103557 AL133415 Hs.297753 vimentir. 

103613 NM_000346Hs.2316 SRY (sex determining region YVbox 9 fca 

40 ssasssaBHsaaa?"*- 

104896 AW015318 Hs.23165 ESTs epeai<oniainin 9^ 
104943 AF072873 Hs.1 14218 frizzled (DrosophilaJ homolog 6 
105038 AW503733 Hs.9414 KIAA1488 protein 

45 105329 AA234561 Hs.22862 ESTs 

105500 AW602166 Hs.222399 CEGP1 protein 
K 25XSE u S ' 3 ? 738 ^etical protein FU10407 
S^ 77314 ^' 5364 DKFZP564!052proteb 

_ A 106012 AI240665 Hs.8895 ESTs 

!££ 22325 " S "™ ^kefactor5(etsdomaintransai P t 
106155 AA425414 Hs.33287 nudear fectorlfc 
107102 AB037765 Hs.30652 KIAA1344 protein 
107136 AV661958 Hs.8207 GK001 protein 
107151 AW378065 Hs.8687 ESTs 
32S 2 65153855 Hs.61460 IgsuperfamiiyreceptorlNIR 
2 S^U 3 ^ P T ^WeaklyslmilartoALU7JlUMANAlJUS 
Sll? £££215 Hs.257924 hypothetical protein FU13782 
109292 AW975746 Hs.188662 KIAA1702 protein 
109415 U80736 Hs.110826 trinucleotide repeat containing 9 
i 1 2£2 ^f 1528 L ^enine;a^r^rK)adiUaminotra 

U25 , HS ' 6614 E STs.WeaklyslmiIartoA43932mudn2p 

110915 BE092285 Hs.29724 hypothetical protein FU13187 
111164 N46180 Hs.122489 Homo sapiens cONAFU13289fis, done OV 
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111179 AK000136 Hs.10760 asporin {LRR class 1) 


25.1 


288 


12 


6.7 


111190 AK002055 Hs.151046 hypothetical protein RJ1 1 193 


6.3 


63 


1 


5.8 


111223 AA852773 Hs.334838 KIAA1866 protein 


3.6 


402 


112 


4.9 


111357 BE314949 Hs.87128 hypothetical protein FU23309 


3.6 


425 


111 


4 


112244 AB029000 Hs.70823 WAA1077 protein 


5.7 


567 


100 


6.7 


113047 AI571940 Hs.7549 ESTs 


9.6 


124 


13 


9 


113702 T97307 gb:ye53h05.s1 Scares fetal fiver spleen 


1Z3 


129 


11 


11.7 


114124 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 


242 


242 


10 


5.6 


114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fir 6.7 


67 
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6.3 


114768 AF212848 Hs.182339 ets homologous factor 


13.7 


137 


1 


8.9 


114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 


9.1 


91 


1 


7.6 


114965 AI733881 HsJ2472 BMP-R1B 


35.9 


359 


10 


29.7 


1 14988 AA251089 gb:zs04f05.s1 NCLCGAP J3CB1 Homo sapiens 1 1 .5 


115 


1 


6.9 


115206 AW1A3G95 Hs 18G572 ESTs 


5.8 


58 


1 


5 


115719 AW992405 Hs 59622 Homo saDiens done IMAGE-3507281 mRNA 


7.6 


144 


19 


13.9 


115844 AI373082 Hs 33293ft hvooihetical orotein MGC5370 


6.2 • 


62 


1 


5.4 


11R47H A1272141 Hs R34R4 ^RY Iwi riphvmWna realon YVhnx 4 


1.8 


1047 596 


1.6 


11fi7Afi H95fl3fi Hq ^01597 PQTq MnrtArafptv clmtlartafmknnumfHs 
110/00 nioooo no.ouiJZf cots, ivwuciaiciy suiwai ujunwiuwiiino 


22.8 


228 


9 


12.4 


117280 M18217 Hs 172129 Homo saDiens cDNA: FLJ21409 fis done C 


3.9 


322 


83 


4.4 


117419 KrtO^fi V4c AORAK end ito pernor famlh/ 1R fnvtnnr^rivtyvtlr 
Ml Hit. fMO&OOO no/ttOHO oUIUU* LoliRSI loll Illy 10 ^lltUIIUUalUUAyiiU 


17.4 


174 


9 


6.9 


11R479 Al 157545 H<? 4917Q hrnmnHnrrwin and PHD fmnpf mntatninn 3 


145 


145 


1 


2.4 


11Q271 Al0fi111fl H<? fi532fl Fanmnl flnpjnta rnmntemenlatton nroun F 


8.2 


82 


1 


6.4 


119771 AI905687 Hs2533 EST 


3.5 


2073 595 


Z1 


120562 RF9445R0 Hs3099fi7 hvnnthpfiral motpjn R J10330 
liUJOi. Dc^ftoou no. JUixu/ iiypuuicuiai piuicui ru iujou 


8.5 


197 
let 


15 

IO 


1.6 


121463 AKQ002fl2 Hs23°.6R1 hvoofhptieal orote'm FU20275 


10.3 


103 
IUO 


4 

1 


9.3 


121723 AA243499 Rs 104S0O hvnnfhetical nrotein FLJ10134 


2.Q, 


214 

C l*t 


74 


3.7 


122963 AA47R446 RsRQ559 K1AA109B nrntein 


7.2 


79 




5.7 


mi*V7 Alft7?Q1*X He MOfiftfi P<?Tc lAfaaklv similar In IP0350 Anterior 


q q 


351 


36 


13.9 




8.5 


85 


1 


4.3 


173709 AA70RQ10 Hq 119749 F^Tq 
l COI Wj Mn/U09lv no. 11*14* coio 


3.9 


60 


"16 


4.8 


i0Af)f\R AM471R*; We 770niR PQTo 


5 ft 


321 


55 


17 


194059 RF3A7135 Hc9A3711 P^Tc Wpakh/ clmitar tr\ Qfi4054 hvnrithpfi 
i^hujo ocoof ooo no. too/ 10 coi 5, v? ccuuy duiiiku iu oohuoh nypuumu 


104 


880 


85 


5.3 


lOil'tnn AA9^0097 Ue 9<H*Jrt7 rilvienmol nmlaln Cfi 

1Z40UO pj\z.<*y\)c( ns.z4iou/ nrosuTOai proiein 00 


10 5 
lv.0 


105 


1 


9.9 


140£fo nS-4//y rJAAl 1 0U pfOieu) 


10. 1 


131 


1 


5.1 


4'>CC17 AAOfl^iyM Uo 4CXQCO COT« 


R 7 


67 


1 


6 


1974^0 nA09^7 He 143Rft QHT Hn main hJnrlinn ntirtamir artrLrirh nr 

I^/Hjo uouzo/ ns.iHooo ono oomain DuKJing yiuiarnK/auiMicii pr 


30 R 
ou.o 


306 


4 


265 


izoow Ajyt>4yoo ns.z/buuy mamx oia proiem 


7 S 


75 


1 


6.5 


128482 AI694143 Hs.296251 programmed cell death 4 


7.2 


72 


1 


5.8 


128790 AF026692 Hs.105700 secreted frizded-related protein 4 


17.4 


409 


24 


7.8 


128925 R67419 Hs.21851 Homo sapiens cONA FU 12900 fis, clone NT 


7.1 


392 


56 


3.6 


129017 AA115333 Hs.107988 ESTs 


8.2 


82 


1 


7.4 


129229 AF013758 Hs.109643 poryadenylate binding proteiiwnteractln 


7.1 


71 


1 


6.2 


129337 NMJ)14918Hs.11Q488 K1AA0990 protein 


9.5 


95 


1 


8.5 


129366 BE220806 Hs.184697 Homo sapiens done 23785 mRNA se9uence 


7.1 


150 


21 


14.5 


129821 AB028945 Hs.12696 cortactin SH3 domain-binding protein 


11.4 


114 


1 


10 


130036 BE061916 Hs,125849 chromosome 8 open reading frame 2 


6.7 


67 


1 


5.7 


130057 AF027153 Hs.324787 solute carrier family 5 (Inositol transp 


1 


1 


1 


1 


130095 AKD01635 Hs.14838 hypothetical protein FU10773 


14.6 


219 


15 


7.6 


130343 AB040914 Hs.27B628 KIAA1481 protein 


132 


331 


25 


12.4 


130385 AW067800 Hs.155223 stannbcalcin 2 


72.2 


722 


1 


1.9 


130407 BE385099 Hs.334727 hypothetical protein MGC3017 


6.5 


65 


4 


5.3 


130441 U63630 Hs. 155637 protein kinase, DNA-actlvaled, catalytic 


6.1 


61 


1 


5.7 


130455 D90041 Hs.155956 N-acetyltransferase 1 (arylamlne Nicety 


10.8 


706 


66 


92 


130604 AA383256 Hs.1657 estrogen receptor 1 


32.2 


322 


1 


4.7 


130617 M90516 Hs.1674 glutajnine-fmctose-6i)hosphatetransanTin 


10 


100 


1 


7.6 


130712 AJ271881 Hs.279762 brornodomain-containing 7 


175 


175 


2 


12.8 


131148 AW953575 Hs.303125 p5oMnduced protein P1GPC1 


3.8 


585 


153 


3.7 


131388 NM_014810Hs.92200 WAA0480 gene product 


7.6 


76 


1 


5 


131584 T93500 Hs.28792 Homo sapiens cDNA FU 11041 fis, clone PL 


4.7 


381 


81 


6.4 


131742 AA961420 Hs.31433 ESTs 


11.7 


117 


1 


10.1 


131877 J04088 Hs.155346 topoisomerase pNA) II alpha (170kD) 


6.8 


68 


1 


5.6 


131985 AA503020 Hs^6563 hypotfietka! protein FU2241 8 


40.2 


402 


1 


4 


132316 U28831 Hs.44566 KIAA1641 protein 


18.6 


186 


10 


15 


132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 


9.3 


93 


1 


8.4 


132742 AA025480 Hs.292812 ESTs, WeaWy similar to T33468 hypotheti 


6.5 


65 


1 


5.6 


132990 X77343 Hs.334334 transcription factor AP-2 alpha (activat 


12.7 


311 


25 


2.4 


133015 AJ002744 Hs.246315 UDP^ceryl^a^aIadosarr^:polyp 


4.6 


427 


93 


10.4 
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133199 AF231981 Hs.250175 homolog of yeast long chain polyunsatura 

133240 AK0Q1489 Hs.242894 ADP-ribosylation factor-like 1 

133271 Z48633 Hs.283742 Rsapiensrr^forretrotransposon 

133640 AW246428 Hs.75355 ublquitin-conjugating enzyme E2N (homolo 

133746 AW410035 Hs.75862 MAO (moihers against decapentaplegic, Dr 

133999 AA535244 Hs.78305 RAB2, member RAS oncogene family 

134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 

134485 X82153 Hs.83942 cathepsln K (pycnodysoslosis) 

134654 AK001741 Hs.8739 hypothetical protein FU10879 

134880 AI879195 Hs.90606 15 kDa selenoprotein 

135029 H58818 Hs.187579 hydroxysteroid (17-beta) dehydrogenase 7 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

128305 A1954968 Hs.279009 matrix Gla protein 



3 


816 


275 


3.9 


8.1 


81 


1 


4.6 


12.4 


124 


6 


10.8 


8.5 


85 


1 


72 


9.3 


93 


1 


7.8 


7.8 


78 


1 


5.6 


4.5 


1472 330 


2.1 


34.3 


411 


12 


5.1 


6.4 


64 


1 


5.1 


5.7 
115 


57 


4 
1 


c 

3 


115 


1 


10 


20.6 


206 


4 


19.1 


9.4 


94 


3 


5.3 
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TABLE 11A 



Table 11 A shows the accession numbers for those pkeys lacking unigeneDD's for Table 1 1. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964 AA609200 

20 113702 genbankJT973Q7 T97307 

114988 genbankJ\A251089 AA251089 



10 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 

Table 12 depicts a preferred group of genes upreguiated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset Identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R£ Ratio of 90 ft percentile tumor to body 

15 R3 : Ratio of 75* percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unigene Title 

20 100131 D12465 Hs.11951 phosphodiesterase I (PC-1) 13.2 244 19 9.9 

105500 AW602166 Hs.222399 ESTs 25.4 508 20 3 

112244 AB029000 Hs.70823 KIAA1077 protein 5.7 567 100 b./ 

114124 W57554 Hs.125019 ESTs 24.2 242 10 5.6 

119771 AI905687 Hs.2533 ESTs 3.5 2073 595 ^.1 

25 121723 AA243499 Hs.104800 ESTs 2.9 214 74 3J 

128790 AF026692 Hs.105700 secreted Mzztakelaled protein 4 17,4 409 24 7.8 

131148 AW953575 Hs.303125 ESTs 3.8 585 153 3.7 

131985 AA503020 Hs.36563 ESTs 402 1 4 

133199 AF231981 Hs.250175 Homo sapiens done 23904 mRNA sequence 3 816 275 3.9 



R1 R2 R3 R4 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 
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Pkey. Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



R1: 



Ratio of tumor to normal body tissue 



Pkey ExAccn UniGenelD Unigene Title 



100038 

100039 

100040 

100041 

100082 

100091 

100100 

100103 

100114 

100121 

100123 

100128 

100131 

100137 

100144 

100147 

100154 

100164 

100169 

100190 

100203 

100209 

100215 

100216 

100219 

100227 

100248 

100287 

100294 

100307 

100335 

100340 

100355 

100363 

100368 

100372 

100375 

100379 

100387 

100393 

100398 

100405 

100406 

100409 

100421 

100446 

100447 

100448 



M97935 

M97935 

M97935 

M97935 

AB003103 

AF000177 

AF006084 

AF007875 

D00596 

D10495 

D10523 

011094 

D12485 

D13627 

D13643 

D13666 

D14657 

D14812 

D14878 

D21090 

D25538 

D26308 

026598 

D26599 

D28137 

D28915 

031888 

D43950 

D49396 

D50525 

063391 

D63487 

D78129 

D78514 

D79987 

D79997 

D80004 

D82060 

D83777 

084145 

D84557 

D85425 

086479 

D86957 



D87464 
D87465 



control 
control 
control 
control 

Hs.4295 proteasome (prosome; macropain) 26S sub 

Hs.1 11783 Lsm1 protein 

Hs. 1 1538 actln related protein 2/3 complex; subunit 

Hs.5085 doBchyl-phosphate mannosyttransferase p 

Hs.62962 UiymidViate synthetase 

Hs.155342 protein kinase C; delta 

Hs.1 68669 oxogkitarate dehydrogenase (lipoamide) 

Hs.61 153 proteasome (prosome; macropain) 26S sub 

Hs.1 1951 phosphodiesterase l/nucleotide pyrophosp 

Hs.15071 chape/onto containing TCP1; subunit 8 (t 

Hs.75616 Human mRNA for K1AA0018 gene; comp 

Hs.136348 osteoblast specific factor 2 (fascfclin Mike 

Hs.81892 KIAA0101 gene product 

Hs.173714 MORF-related gene X 

Hs.82043 D123 gene product 

Hs.178658 RAD23 (S. cerevisiae) homolog B 

Hs.172199 adenylate cyclase 7 

Hs.76289 blUverdin reductase B (flavin reductase (N 

Hs.82793 proteasome (prosome; macropain) subunit 

Hs. 1 390 proteasome (prosome; macropain) subunit 

Hs.118110 bone marrow stromal cell antigen 2 

Hs.82316 Interferon-induced; hepatitis C-associated 

Hs.78398 WAA0071 protein 

Hs,1 600 chaperonin containing TCP1 ; subunit 5 (e 

Hs.75454 antioxidant protein 1 

Hs.699 hypothetical protein 

Hs.6793 platelet-activating factor acetylhydrolase; 

Hs.82563 KIAA0153 protein 

Hs.71465 Homo sapiens mRNA for squalene epoxid 

Hs.78563 ubiquitin-conjugating enzyme E2G 1 (horn 

Hs.153479 extra spindle poles; S. cerevisiae; homolo 

Hs.184339 K1AA0175 gene product 

Hs.75909 KIAA0182 protein 

Hs.278721 Ke4 gene; mouse; human homolog of 

Hs.75137 WAA0193 gene product 

H&39913 novel RGD-containing protein 

Hs.155462 rnlnichramosome maintenance deficient (m 

Hs.82733 nldogen2 

Hs.1 18397 AE-binding protein 1 

Hs.80712 K1AA0202 protein 

Hs.79276 Human mRNA for WAA0232 gene; comp 

Hs.10037 WAA0274 gene product 

Hs.74583 KIAA0275 gene product 

Hs.57652 EGF^Kiomain; multiple 2 

215 



R1 

16.7 

6.3 

8.3 

14.8 

7.5 

4.9 

4.7 

13.4 

15.9 

4.6 

7.5 

4.4 

8.7 

9.5 

6 

8.5 

10.5 

4.6 

7.9 

5.6 

9.9 

4.9 

14.2 

11.3 

5.7 

5.7 

7.4 

5.6 

12.9 

8.4 

6.8 

4.4 

12.6 

4.6 

6.5 

8.4 

4.5 

8.1 

10.7 

72 

72 

5.4 

4.3 

11.9 

9.7 

6.4 

10 

6.2 
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100467 D89052 


Hs.7476 


ATPase; H+ transporting; lysosomal (vacu 


7.5 


100468 D89289 


Hs.1 18722 


fucosyttransferase 8 (alpha (1;6) fucosyltr 


5 


100486 HT1112 


Hs.10842 


Ras-Uke Protein Tc4 


16.9 


100497 HT1400 


Ha.79137 


Carboxyi Methyltransferase, Aspartate, A 


5.6 


100618 HT2710 


Hs.1 14599 


Collagen, Type VSI, Alpha 1 


7,5 


100661 HT3018 


Hs.132748 


Ribosomai Protein L39 Homolog 


4.4 


100667 HT3127 


Hs.169610 


Epican, Ait Splice 11 


4.6 


100668 HT3938 


Hs.169610 


Epican, Alt Splice 12 


4.4 


100676 HT3742 


Hs.287820 


Rbronectin.Alt Splice 1 


9 


100775 HT26388 


Hs.69603 


Mucin 1, Epithelial, Alt Splice 9 


4.7 


100783 HT4018 


Hs.191356 


Basic Transcription Factor, 44 Kda Subun 


13.7 


100829 HT4343 


Hs.278544 


Cytosolic Acetoacetyl-Coenzyme A Thio 


10.6 


100830 HT4344 


Hs.4756 


Rad2 


5.5 


100840 HT4392 


Hs.183418 


Protein Kinase Pitslre, Alpha, Alt. Splice 


4.1 


100850 HT417 


Hs.297939 


CathepslnB 


4 


100866 HT4582 


Hs.75113 


Transcription Factor Ilia 


4.9 


100906 HT5158 


Hs.5398 


Guanosine S'-Monophosphate Synthase 


8.7 


100914 HT511 


Hs.324178 


Ras Inhibitor Inf 


7.2 


100916 HT544 


Hs.73946 


Endothelial Cell Growth Factor 1 


5.9 


100945 HT884 


Hs.1 80686 


Oncogene E6-Ap, Papillomavirus 


4.6 


100975 J02923 


Hs.76506 


lymphocyte cytosolic protein 1 (L-ptastin 


30.1 


100988 J03589 


Hs.76480 


ubiquitMe 4 


8.3 


100996 J03909 


Hs.14623 


interferon; gamma-inducible protein 30 


6.9 


100999 J03934 


Hs.80706 


dlaphorase (NAOH/NAOPH) (cytochrom 


4.3 


101011 J04430 


Hs.1211 


acid phosphatase 5; tartrate resistant 


5.9 


101017 J04599 


Hs.821 


biglycan 


5.1 


101031 J05070 


Hs.151738 


matrix metalloproteinase 9 (gelatinase B; 


37.2 


101038 J05249 


Hs.79411 


replication protein A2 (32kD) 


6.1 


101054 K024O5 


Hs.73931 


Human MHC class II HLA-DQ-beta mRN 


4.3 


101061 K03515 


Hs.180532 


glucose phosphate isomerase 


4.3 


101091 L06132 


Hs.149155 


voltage-dependent anion channel 1 


7.4 


101097 106797 


Hs.89414 


chemoklne (C-X-C motif}; receptor 4 (fus 


4.6 


101104 L07615 


Hs.169266 


Human neuropeptide Y receptor Y1 (NPY 


18.3 


101143 L12723 


Hs.90093 


heat shock 70kD protein 4 


17.4 


101152 L13800 


Hs.9884 


Homo sapiens liver expressed protein gen 


7.6 


101183 L19779 


Hs.795 


H2A histone family; member 0 


10.9 


101216 L25876 


Hs.84113 


cyclln-dependent kinase Inhibitor 3 (CDK 


7.4 


101233 L29008 


Hs.878 


sorbitol dehydrogenase 


14.6 


101247 L33801 


Hs.78802 


glycogen synthase Kinase 3 beta 


7.5 


101282 L38810 


Hs.79387 


proteasome (prosome; macropain) 26S sub 


4.4 


101326 142572 


Hs.78504 


inner membrane protein; mitochondrial (m 


5.8 


101332 L47276 


Hs.156346 


Homo sapiens (cell fine HL-6) alpha topo 


18.9 


101348 L77213 


Hs.30954 


phosphornevalonate kinase 


7.5 


101352 L77701 


Hs.16297 


COX17 (yeast) homolog; cytochrome c ox 


9.3 


101378 M13755 


Hs.833 


interferon-stimulated protein; 15 kDa 


18.1 


101396 M15796 


Hs.78996 


proliferating cell nuclear antigen 


8.6 


101404 M16342 


Hs.182447 


heterogeneous nuclear ribonucleoprotein C 


4.5 


101439 M20902 


Hs.268571 


apoprotein C-l 


6.1 


101464 M22538 


Hs.51299 


NAOH dehydrogenase (ubiquinone) flavo 


8.7 


101469 M22877 


Hs.169248 


Human somatic cytochrome c (HCS) gene 


4.2 


101472 M22960 


Hs.118126 ' 


protective protein (or beta-galactosidase ( 


6,5 


101478 M23379 


Hs.758 


RAS p21 protein activator (GTPase adiva 


14 


101484 M24594 


Hs.20315 


interferon-induced protein 56 


d2 


101539 M30818 


Hs.926 


myxovlrus (influenza) resistance 2; homo! 


5.1 


101540 M30938 


Hs.84981 


X*ay repair complementing defective rep 


4.7 


101544 M31169 




Human propionyl-CoA carboxylase beta-s 


5.5 


101552 M31642 


Hs.82314 


hypoxanthlne rjhosphoribosyltransferase 1 


8.5 


101580 M34677 


Hs.83363 


DNA segment on chromosome X (unique) 


4.5 


101600 M37583 


Hs.119192 


H2A histone family; member Z 


5.7 


101663 M60750 


Hs.2178 


H2B histone family; member A 


5.8 


101664 M60752 


Hs.121017 


H2A histone family; member A 


135 


101667 M60858 


Hs.79110 


nucfeolin 


4 


101684 M63256 


Hs.75124 


cerebellar degeneration-related protein (62 


7.6 


101702 M64929 


Hs.179574 


protein phosphatase 2 (formerly 2A); regu 


4.2 


101754 M77142 


Hs.239489 


TIA1 cytotoxic granule-associated RNA-b 


4.5 


101758 M77836 


Hs.79217 


pynrofine-5-carboxylate reductase 1 


5.7 


101767 M81057 


Hs.180884 


carboxypeptidaseBI (tissue) 


21.7 
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101770 M81601 Hs.78869 transcription elongation factor A (Sll); 1 4.6 

101791 M83822 Hs.62354 ceB division cycte 4-flke 9.7 

101803 M86546 Hs.155691 pna-B-ceD leukemia transcription factor 1 5.5 

101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 22.5 

5 101839 M93036 Hs.692 membrane component; chromosomal 4; su 4 

101851 M94250 Hs.82045 midkine (neurite grawft-prornoting factor 7.6 

101888 M99701 Hs.95243 transcription elongation factor A (StiJ-Iike 11.4 

101973 S82597 Hs.80120 UDP^^tyWpha^alactosamine:po 4.6 

101991 U00968 Hs.166 Human SREBP-1 mRNA; complete cds 4.1 

10 102009 U02680 Hs.82643 protein tyrosine kinase 9 4.4 

102025 U03911 Hs.78934 mutS (E. cofi) homotog 2 (colon cancer; n 4 

102047 U07158 Hs.83734 syntaxin 4A (placental) 6.1 

102051 U07550 Hs.1197 heat shock 10kD protein 1 (chaperonin 10 4.4 

102083 U10323 Hs.75117 interieukin enhancer binding factor 2; 45k 10.4 

15 102095 U11313 Hs.75760 sterol carrier protein 2 9.5 

102130 U15009 Hs.1575 small nuclear ribonudeoprotein D3 polyp 6.6 

102133 U15173 Hs.155596 BCL2/adenovirusE1B 19kWnteracting p 4.3 

102148 U16954 Hs.75823 Aai-fusedger»framdiromosome 1q 6.9 

102179 U19713 Hs.76364 altograftintlarranatory factor 1 4.8 

20 102180 U19718 Hs.63551 microfibriliar-associated protein 2 7.2 

102193 U20758 Hs.313 secreted phosphoprotern 1 (osteopontin; b 7.2 

102198 U2W0 Hs.74598 po/ymerase(D^o1rected);deila2;regu 4.3 

102202 U21931 Hs.574 mjctose^sphosphatase 1 4.5 

102209 U22970 Hs.265827 interferon; aipna-inducibte protein {done 9.9 

25 102211 U23070 Hs.78776 putative transmembrane protein 4.9 

102220 U24389 Hs.65436 fysyi oxidase-fike 1 8.5 

102224 U24704 Hs.148495 proteasome (prosome; macropain) 26S sub 5.4 

102234 U26312 Hs.278554 chrornobox homotog 3 (Drosophffa HP1 g 7.7 

102250 U28014 Hs.74122 caspase 4; apoptosis-related cysteine prot 5.4 

30 102260 U28386 Hs.159557 karyopherin alpha 2 (RAG cohort 1; impo 6.3 

102261 U28488 Hs.155935 complement component 3a receptor 1 5.7 

102273 U30888 Hs.75981 ubiquitin specific protease 14 (tRNA-guan 6.1 

102298 U32849 Hs.54483 N-myc (and STAT) interactor 4.1 

102302 U33052 Hs.69171 protein kinase (Mike 2 4.3 

35 102305 U33286 Hs.90073 chromosome segregation 1 {yeast homolo 5.4 

102320 U34683 Hs.82327 glutathione synthetase 4.1 

102330 U35451 Hs.77254 chrornobox homotog 1 (Drosophita HP1 b 4 

102348 U37519 Hs.87539 aldehyde dehydrogenase 8 9.4 

102361 U39400 Hs.75859 chromosome 11 open reading frame 4 5.2 

40 102362 U39412 Hs.75932 N^thylrnaleNde^ensitivefadwatfachm 9.3 

102369 U39840 Hs.299867 hepatocyte nuclear factor 3; alpha 7.7 

102395 U41767 Hs.92208 a disinteg rin and metalloproteinase domai 10.4 

102409 U43266 H$.1W25 seienophosphate synthetase 2 6.2 

102418 U43923 Hs.79058 suprjress<)rofTy(S.cerevisfae)4r>ornolog 4.1 

45 102425 U44772 Hs.3873 palmitoyi-protein thtoesterase (ceraid-lipo 4.8 

102457 U48807 Hs.2359 dual specificity phosphatase 4 6.3 

102465 U49352 Hs.81548 2;4-dienoyl CoA reductase 1; mitochondri 9.4 

102495 U51240 Hs.79356 Lysosornal-assotiated multispanning mem 6.5 

102534 U56833 Hs.198307 von HippeHJndau binding protein 1 8.6 

50 102546 U57877 Hs.3577 succinate dehydrogenase complex; subuni 4.3 

102549 U58046 Hs.198899 eukaryofc translation initiation factor 3; s 6.3 

102557 U58766 Hs.264428 tissue specific transplantation antigen P35 5 

102562 U59309 Hs.75653 (umarate hydratase 6 

102568 U59877 Hs.223025 RAB31; member RAS oncogene family 9.1 

55 102580 U60808 Hs.152981 CDP-diacylgrycerol synthase (phosphatid 7.9 

102581 U61145 Hs.77256 enhancer of zeste (Drosophila) homotog 2 7.6 

102590 U62136 Hs.79300 Homo sapiens enterocyte differentiation a 7 

102591 U62325 H&324125 amyloid beta (A4) precursor protein-bindi 4 

102592 U62389 Hs.11223 Human putaflve cytosolfc MADP-depende 5 
60 102617 U65928 Hs.198767 Jun activafion domain binding protein 6.1 

102618 U65932 Hs.81071 extracellular matrix protein 1 23.2 

102638 U67319 Hs.9216 caspase 7; apoptosis-related cysteine prot 8.9 

102663 U70322 Hs.168075 karyopherin (Importin) beta 2 7,1 

102666 U706Q0 Hs.2799W ATX1 (antioxidant pmtem 1; yeast) homo 4.7 

65 102679 U72661 Hs.11342 nlnjurin 1; nerve injury-induced protein-1 4.7 

102687 U73379 Hs.93002 ub^n carrier protein E2-C 7.7 

102704 U76638 Hs.54089 BRCA1 associated RING domain 1 5.6 
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102705 U77180 
102721 U79241 
102729 U79254 
102739 U79282 
102742 U79293 
102761 U82130 
102788 U86602 
102790 U87269 
102801 U89606 
102808 U90426 
102817 U90904 
102823 U90914 
102827 U91327 
102838 U94592 
102841 U95006 
102844 U96113 
102868 X02419 
102907 X06985 
102919 X12447 
102929 X13238 
102973 X16663 
102983 X17620 
102985 X17644 
103003 X52003 
103018 X53296 
103023 X53793 
103036 X54925 
103060 X57766 
103073 X59417 
103075 X59543 
103080 X59798 
103094 X60787 
103105 X61970 
103121 X63679 
103149 X66363 
103180 X69433 
103182 X69819 
103188 X70040 
103191 X70218 

103193 X70476 

103194 X70649 

103195 X70940 

103206 X72755 

103207 X72790 

103208 X72841 
103216 X74262 
103226 X75042 
103230 X75861 
103262 X78565 
103278 X79882 
103297 X81788 
103302 X82103 
103316 X63301 
103330 X85373 
103349 X89059 
103352 X89398 
103364 X90872 
103374 X91788 
103380 X92396 
103395 X94754 
103402 X95404 
103410 X96506 

103420 X97065 

103421 X97074 
103427 X97303 
103430 X97544 
103438 X98263 



Hs.50002 

H8.1 18666 

Hs.181311 

Hs.155572 

Hs.159264 

Hs.1 18910 

Hs.74407 

Hs.154196 

Hs.38041 

Hs.179606 

Hs.83724 

Hs.5057 

Hs.6456 

Hs.80658 

Hs.37616 

Hs.324275 

Hs.77274 

Hs.202833 

Hs.74649 

Hs.14601 

Hs.1 18638 

Hs.2707 

Hs.1406 

Hs.81134 

Hs.117950 

Hs.83169 

Hs.155324 

Hs.74077 

Hs.2934 

Hs.82932 

Hs.296281 

Hs.76913 

Hs.4147 

Hs.171834 

Hs.5337 

Hs.99995 

Hs.2942 

Hs.2903 

Hs.75724 

Hs.78580 

Hs.2642 

Hs.77367 

Hs.31314 

Hs.16003 

Hs.44313 

Hs.74637 

Hs.289114 

Hs.80680 

Hs.9078 

Ks.3059 

Hs.324728 

Hs.77496 

Hs.78853 

Hs.279929 

Hs.84974 

Hs.24167 

Hs.279946 

Hs;180370 

Hs.295362 

Hs.1 73497 

Hs.1 19591 

Hs.250655 

Hs.20716 

Hs.152720 



small inducible cytokine subfamily A (Cy 


11.8 


Human done 23759 mRNA; partial cds 


15 


asparagtnyl-tRNA synthetase 


5 


Human clone 23801 mRNA sequence 


6 


Human done 23948 mRNA sequence 


13.1 


tumor susceptibility gene 101 


7 


nucleolar protein p40 


4.1 


£4F transcription factor 1 


7.1 


pyridoxal (pyridoxine: vitamin B6) kinase 


4.7 


nuclear RNA heBcase; OECO variant of O 


7.5 


Human done 23773 mRNA sequence 


15.2 


carboxypeptidaseD 


6.6 


chaperonin containing TCP1; subwiH 2 (b 


6 


Human uncoupling protein homolog (UCP 


6.1 


Human 09 splice variant 8 mRNA; comp 


4.2 


Homo sapiens Nedd-4-like ubiquitin-prot 


6.8 


plasminogen activator; urokinase 


4 


heme oxygenase (decyding) 1 


22.7 


aldolase A; fructose-bisphosphate 


9.9 


cytochrome c oxidase subunit Vic 


5,4 


hematopoietic cell-specific Lyn substrate 


4^8 


non-metastatic cells 1; protein (NM23A) 


4.6 


G1 to S phase transition 1 


20.6 


trefoil factor 1 {breast cancer, estrogen-ind 


10.7 


Interieukin 1 receptor antagonist 


5.8 


multifunctional polypeptide similar to SA 


4 


matrix metaDoproteinase 1 (interstitial col 


7.3 


matrix metaOoproteinase 11 (stromeiysin 


17.8 


proteasome (prosome; macropain) subunit 


5.6 


ribonucleotide reductase M1 polypeptide 


4.2 


cydin D1 (PRAD1: parathyroid adenomat 


6.7 


interieukin enhancer binding factor 1 


5.7 


proteasome (prosome; macropain) subunit 


5.8 


translocating chain-assodating membrane 


4.2 


PCTAIRE protein kinase 1 


12 


Isodtrate dehvdroaenasa 2 fNADP+V mil 


18.9 


intercellular adhesion molecule 3 


10.7 


macrophage stimulating 1 receptor (c-met 


4.1 


protein phosphatase 4 (formerly X); cataty 


10.7 


coatomer protein complex; subunit beta 2 


8.2 


DEAD/H f^p-Glu-Ala-Asp/His) box poi 


13.7 


eukaryotic translation elongation (actor 1 


13.4 


monokine induced by gamma interferon 


15.1 


Human endogenous retrovirus mRNA for 


5.3 


retinobiastoma-blnding protein 7 


12.3 


retinoblastoma -bbdtng protein 4 


4.1 


v-rel avten reCculoendotheGosis viral onco 


6.9 


testis enhanced gene transcript 


79 


hpxahrarhinn ftprvrcrm Cs rvtnfartin^ 
itcAaujoMiiuii ^icimowii V| \*yvj\a\Mi\j 


c 
o 


lung resistance-related protein 


5.7 


Immature colon carcinoma transcript 1 


4.6 


coatomer protein complex; subunit beta 


4.5 


SMA5 


7.1 


small nuclear ribonudeoprotein polypepti 


4 


serine/threonine kinase 9 


4.7 


uracO-DNAgrycosytase 


5.3 


SULT1C suifotransrerase 


4 


chloride channel; nudeotide-sensitive; 1A 


4.2 


synaptobrevin-like 1 


13.6 


methionlne-tRNA synthetase 


14.2 


cofilin 1 (non-muscle) 


4.6 


DRI-assodated protein 1 (negative cofect 


8.3 


Sec23 (S. cerevlsiae} homolog B 


4.9 


adaptor-related protein complex 2; sigma 


5 


Haptens mRNA for Ptg-1 2 protein 


7 


translocase of Inner mitochondrial membr 


4.5 


M-phase phosphoproteln 6 


4.5 



218 



WO 02/059377 PCT/US02/02242 



103464 Y00285 Hs.76473 insulin-like growth factor 2 receptor 4.2 

103470 Y00796 Hs.174103 integrin; alpha L {antigen CD11A (p180); 4.5 

103494 Y08991 Hs.83050 phosphatidylinositoJ 3-Wnase-assoclated p 4.1 

103505 Y09912 Hs.33102 transcription factor AP-2 beta {activating 4.5 

5 103547 Z14982 Hs.1 80062 proteasome (prosome; macropain) subunit 4.3 

103551 Z15115 Hs.75248 topoisomerase {DNA) II beta (180kD) 4 

103565 Z22548 Hs.1 46354 thioredoxln-dependent peroxide reductase 7.6 

103587 Z29083 Hs.82128 5T4 oncofetal trophoi>last glycoprotein 14.6 

103621 Z47727 Hs.150675 polymerase {RNA) II (DNA directed) pol 6.3 

10 103622 Z48042 Hs.278672 membrane component; chromosome 11; s 4.4 

103658 Z74615 Hs.172928 collagen; type i; alpha 1 5.9 

103680 Z93784 Homo sapiens DMA sequence from PAC 4.4 

103772 AA092473 Hs.278554 chromobox homolog 3 (Drosophfla HP1 g 4.9 

103774 AA092898 Hs.92918 ESTs; WeaWysirnOartoR07G3^[C.eleg 6.1 

15 103821 AA157623 Hs.198793 KIAA0750 gene product 23.3 

103835 AA172215 Hs.93748 ESTs; Moderately similar to TRANSCRIP 4 

103886 AA236384 Hs.105737 ESTs; Weakly similar to gene 9306 protel 4.9 

103890 AA236843 Hs.72085 ESTs; Weakly similar to unknown [S.cere 7.8 

103892 AA243523 Hs.239189 ESTs 4.8 

20 104054 AA393432 Hs.7100 hypothetical protein 5.3 

104115 AA428090 Hs.26102 ESTs 28.7 

104136 AA442669 Hs.268371 zv68f6rl Scares JotaLfetus_Nb2HF8_9w 5.7 

104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 6.9 

104173 AA476564 Hs.76561 ESTs; Weakly similar to finger protein HZ 5.2 

25 104181 AA479521 Hs.283740 ESTs 7.8 

104183 AA480838 Hs.1 14309 ESTs 5,1 

104192 AA486946 Hs.21321 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104209 AB000221 Hs.16530 small Inducible cytokine subfamily A (Cy 12.3 

104234 AB002357 Hs.168212 Idnesln family member 3B 6.2 

30 104271 C01687 Hs.7381 ATP synthase; H+ transporting; mitochon 4.2 

104276 C02582 Hs. 109253 ESTs; Highly similar to N-termlnal acetyl 4.5 

104307 D52818 Hs.111680 endosuifine alpha 4.7 

104309 D55869 Hs.284123 Homo sapiens mRNA full length insert cD 4.2 

104370 H19376 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 6.4 

35 104446 L44497 Hs.7351 ESTs 4.9 

104453 M19169 Hs.123114 cystatinSN 11.6 

104476 N33807 Hs.324275 protease; serine; 15 5.6 

104558 R56678 Hs.88959 Human DNA sequence from done 967N2 6.3 

104592 R81003 Hs.325820 serine protease; umbilical endothelium 13,6 

40 104634 AA004274 Hs.19151 ESTs 6.3 

104636 AA004415 Hs.106106 ESTs 10.1 

104658 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104667 AA007234 Hs.30098 ESTs 16.6 

104675 AA009596 Hs.301553 ESTs; Moderately similar to U1I ALU SU 4.6 

45 104767 AA025534 Hs.8852 . ESTs 4.8 

104785 AA027163 Hs.7942 ESTs 8.1 

104791 AA029046 Hs.301871 ESTs; Moderately similar to cAMP Indue 10.9 

104804 AA031357 Hs.31803 ESTs; Weakly similar to N-WASP flisap 5.5 

104807 AA032147 Hs.23296 ESTs 10.4 

50 104837 AA039469 Hs.21126 ESTs; Weakly similar to K1AA0299 [H.s 4.6 

104849 AA040270 Hs. 24 1507 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104867 AA045481 Hs.225979 Human gene from PACs 37M17 and 305B 4.5 

104884 AA053021 Hs.14511 SCO {cytochrome oxidase deficient; yeast 4.7 

104906 AA055809 Hs.26802 EST s; Weakly similar to phosphopratein f 8.8 

55 104919 AA057193 Hs.25252 ESTs 5.5 

104921 AA057839 Hs.1508 ESTs 4.2 

104926 AA058846 Hs.33363 DKFZP434N093 protein 7 

104938 AAD64627 Hs.318725 ESTs; Highly similar to CGI-72 protein [H 7.1 

104943 AA065217 Hs.114218 ESTs 5.7 

60 104957 AA074919 Hs.10026 ESTs; WeaWy similar to ORF YJL063c [S 4.7 

104961 AA076672 Hs.33905 ESTs 5.5 

104968 AA084602 Hs.29869 ESTs 4.3 

104975 AA086071 Hs.50758 chrorricsome-associated polypeptide C 8.3 

104977 AA088228 Hs.18272 ESTs 6.2 

65 104978 AA088458 Hs.19322 ESTs 6.7 

104987 AA101723 Hs.11861 ESTs 9.2 

105002 AA113266 Hs.182704 ESTs; Moderately similar to alternatively 6.9 
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105012 AA116036 Hs.9329 chromosome 20 open reading frame 1 

105019 AA121879 Hs.9280 proteasoma (prosome; macropain) subunft 

105029 AA126855 Hs.13268 ESTs 

105033 AA127964 Hs.274329 TP53 target gene 1 

5 105035 AA128486 Hs.8859 ESTs 

105039 AA130349 Hs.36475 ESTs 

105062 AA134958 Hs.36529 ESTs 

105076 AA142858 Hs.37810 ESTs 

105087 AA147884 Hs.9812 ESTs 

10 105091 AA148859 Hs.179909 ESTs; Weakly similar to III! ALU SUBFA 

105093 M149051 Hs.32405 ESTs 

105107 AA152302 Hs.25035 DKFZP566G223 protein 

105127 AA158132 Hs.301957 ESTs; Weakly similar to contains similar! 

105132 AA159501 Hs.247280 HBV associated factor 

15 105143 AA165333 Hs.24808 ESTs 

105154 M171736 Hs.35947 methyl-CpG binding domain protein 4 

105162 AA176690 Hs.4084 KIAA1025 protein 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 

105209 AA205072 Hs.227743 KIAA0980 protein 

20 105223 AA211388 Hs.7750 ESTs 

105252 AA227428 Hs.9728 ESTs; Weakly sbnilar to WAA0512 prote 

105253 AA227448 Hs.5003 KIAA0456 protein 
105261 AA227871 Hs.6361 MEK partner 1 
105263 AA227926 Hs.6682 ESTs 

25 105274 AA228122 Hs.281866 ATPase; H+ transporting; lysosomal (vacu 

105297 AA233451 Hs.183858 transcriptional intermediary factor 1 

105309 AA233790 Hs.4104 ESTs; Weakly similar to cDNA EST yk38 

105312 AA233854 Hs.23348 S-phase kinase-assodated protein 2 (p45) 

105342 AA235286 Hs.157078 ESTs 

30 105376 AA236559 Hs.8768 ESTs; Weakly similar to III! ALU SUBFA 

105386 AA236950 Ks.8115 ESTs 

105397 AA242B68 Hs.7395 ESTs; Weakly similar to house-keeping p 

105399 AA2430O7 Hs.16420 ESTs; Highly similar to SH3 domain-bind 

105400 AA243052 Hs.65648 RNA binding motif protein 8 
35 105404 AA243303 Hs.21187 ESTs 

105409 AA243562 Hs.301855 ESTs 

105436 AA252172 Hs.237856 ESTs; Moderately similar to cAMP Indue 

105483 AA255874 Hs.23458 ESTs 

105493 AA256268 Hs.10283 ESTs 

40 105495 AA256317 Hs.28785 Homo sapiens mRNA; cDNA DKFZp586 

105496 AA256323 Hs.301997 DKFZP434N126 protein 

105500 AA256485 Hs.222399 CG1-96 protein 

105507 AA256678 Hs.226318 ESTs; Moderately similar to CCR4-assod 

105538 AA258860 Hs.32597 ring finger protein (C3H2C3 type) 6 

45 105544 AA261954 Hs.24678 ESTs 

105546 AA262032 Hs.268281 ESTs; Weakly similar to 62D9.a (D.melan 

105549 AA262417 Hs.5415 ESTs 

105551 AA262477 Hs.25292 ribonuclease HI; large subunlt 

105560 AA262783 Hs.306915 ESTs 

50 105565 AA278302 Hs.18349 ESTs; Weakly similar to partial COS [C.e 

105566 AA278323 Hs.17481 Homo sapiens clone 24606 mRNA sequen 

105575 AA278717 Hs.12772 ESTs 

105584 AA279012 Hs.3454 ESTs; Weakly similar to KIAA0665 prote 

105596 AA279418 Hs.18490 ESTs 

55 105604 AA279787 Hs.15467 ESTs; Moderately similar to putative pho 

105610 AA279991 Hs.99872 ESTs; Weakly similar to trithorax homolo 

105621 AA280865 Hs.6375 Homo sapiens mRNA; cDNA DKFZp564 

105627 AA281245 Hs.23317 ESTs 

105638 AA281599 Hs.247817 Homo sapiens mRNA for for histone H2B 

60 105645 AA282138 Hs.11325 ESTs 

105650 AA282347 Hs.25635 ESTs; Highly similar to HSPC003 [H.sap 

105666 AA283930 Hs.34906 ESTs 

105674 AA284755 Hs.279789 CDW52 antigen (CAMPATH-1 antigen) 

105687 AA286809 Hs.28423 ESTs 

65 105700 AA287643 Hs.35254 ESTs; WeaWy similar to hypothetica! pro 

105705 AA290767 Hs. 101 282 Homo sapiens mRNA; cDNA DKFZp434 

105709 AA291268 Hs.26761 OKFZP586L0724 protein 
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105731 AA292711 Hs.29131 ESTs 6.4 

105753 AA299789 Hs.110857 ESTs 7 

105774 AA348014 Hs.23412 ESTs 7.1 

105784 AA350771 Hs.17850 ESTs 13.4 

5 105791 AA358038 Hs. 14358 SHJ-bindmg domain glutamic acid-rich p 4.3 

105807 AA393803 Hs.168B9 ESTs; Moderately similar to COLLAGEN 5.3 

105808 AA393808 Hs.286131 KIM0438 gene product 4.1 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGI-27 protein [H 14.6 

105813 AA394140 Hs.18585 ESTs 4.9 
10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cDNA OKFZp564 4.9 

105870 M399623 Hs.101067 ESTs 4.8 

105874 AA400074 Hs.171118 ESTs 4 

105896 AA400999 Hs.7838 Human ring zinc-finger protein (ZNF127- 4.8 

105934 AA404248 Hs.16577 ESTs 5.2 

15 105935 AA404277 Hs.263727 ESTs; Wealdy similar to bisphosphate 3'- 4 

105966 AA406105 Hs.5344 adaptor-related protein complex 1; gamma 8.3 

105974 AA406321 Hs.6224 KIAA0895 protein 4.6 

105990 M410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 4.5 

105995 AA410510 Hs.5345 ESTs 4.9 

20 106000 AA410972 Hs.20726 ESTs 5.8 

106007 AA411462 Hs.11042 ESTs; Weakly similar to Ve01 {H.sapien3 6.9 

" 106016 AA411819 Hs.8164 . WAA0898 protein 5 

106034 AA412473 Hs.14928 ESTs 6.6 

106042 AA412700 Hs.169895 ubiquitin-conjugaling enzyme E2L 6 4.6 

25 106057 AA417067 Hs.289074 ESTs 4.5 

106065 AA417558 Hs.25206 ESTs 123 

106070 AA417761 Hs.5957 Homo sapiens clone 24416 mRNA sequen 5 

106103 AA421104 Hs.12094 ESTs 15.4 

106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR (M.m 6.4 

30 106154 AA425304 Hs.6994 ESTs 5.1 

106157 AA425367 Hs.34892 ESTs 111 

106166 AA425872 Hs.19561 NADH dehydrogenase (ubiquinone) 1 alp 19.3 

106204 AA428024 Hs.21479 ESTs 47 

106210 AA428239 Hs.10338 ESTs 5.7 

35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metargidin p 77 

106236 AA429951 Hs.21104 ESTs 8 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Yb218cp [S.cere 4.4 

106263 AA431462 Hs.28329 ESTs 4.9 

106288 AA435536 Hs.24336 ESTs 8.8 

40 106293 AA435591 Hs.301444 signal sequence receptor; gamma (transloc 8.7 

106310 AA436244 Hs.17240 ESTs 4.5 

106317 AA436568 Hs.108124 ESTs 4 

106328 AA436705 Hs.28020 WAA0766 gene product 4.4 

106341 AA441798 Hs.5243 ESTs; Moderately similar to plL2 hypoth 23.7 

45 106348 AA442253 Hs.10702 ESTs 4.7 

106350 AA442763 Hs.194698 cycfinB2 6.1 

106371 AA443923 Hs.170310 ESTs 6.8 

106389 AA446949 Hs.6236 ESTs 4.7 

106394 AA447223 Hs.25320 Homo sapiens clone 25142 mRNA sequen 4.4 

50 106426 AA448282 Hs.16206 ESTs; Weakly simflar to F55C12.5 [C.ele 4.5 

106459 AA449741 Hs.4029 gDorre-ampGfied sequenced 4.8 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGI-77 protein |H 5.2 

106468 AA450047 Hs.14770 ESTs 6 8 

106479 AA450351 Hs.75251 ESTs 12.4 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 4.5 

106503 AA452411 Hs.29679 ESTs; Highly similar to mediator IH^aple 5.1 

106507 AA452584 Hs.267819 protein phosphatase 1; regulatory (inhibit) 4.9 

106533 AA453786 Hs.145998 ESTs 8.3 

106568 AA455970 Hs.28285 patched related protein translocated In ren 7.6 

60 106586 AA456598 Hs.57787 ESTs 6.2 

106589 AA456646 Hs.28661 ESTs 4.8 

106606 AA457730 Hs.283437 Homo sapiens done 23851 mRNA sequen 4.4 

106611 AA458904 Hs.26267 ESTs; Wealdy similar to torslnA [Rsapte 7 

106614 AA458934 Hs.256150 ESTs 4.5 

65 106628 AA459657 Hs.12311 Homo sapiens clone 23570 mRNA sequen 6.5 

106637 AA459951 Hs.250824 ESTs 5.5 

106644 AA460239 Hs.12680 ESTs 4.4 
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106664 AA460969 Ks7510 


mitogen-activated protein kinase kinase Id 


8.4 


106698 AA463745 Hs 29403 


ESTs- Weaklv similar to PROBABLE AT 


5.3 


106719 AA465171 Hs 236844 


ESTs 


5.6 


106726 AA465339 Hs.3886 


ESTs 


10.1 


106747 AA476473 Hs 171957 

1 Uvl Hi JWrfUHf J MO. 1 f I3JI 


trinlp fundlona) domatn fPTPRF Interadi 


10.4 


106759 AA477263 Hs 2*^84 


ESTs 


4.2 


106765 AA477717 Hs 306117 


Intpripukin 13 nscsnfor abha 1 


6.9 


106784 AA478558 Hs 227913 

ll/UlUt HHtI ViMV 1 IO.IX.J J 1 <J 




5.1 


IUU0JI fWtUAUIH n9.£.?*twJ 




<; 1 


106S36 AA4A2112 Hs 238707 


ESTs 


4.8 


1flRA4ft AA4R2R4A HeRR34 
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IUO0D0 Art** 00 loo no.^oo l£J 
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62 


IfiRPKR AA4A797A He 1Q47Q 
IUO00O rtrtNOI C£X3 no. lirtf J 


F<?Te 


4.5 


1flfifl7R AA4AAA7? He 12314 

lUOOlO /VVtOQOlA TO. IcJIH 


Homo saofens mRMA- cDMA DKPZd586 


7.9 


IHRAAR AA4M1M He 24734 
iuoooo rt/vtojiui na.t*»io*t 




6.4 


IflRAQR AAAAQRRR He 7S94R 
tuoooo /vv+oovwj no.tJt.HJ 


PSTs 

CO 19 


4.6 


mKQAQ AAAQTftT'l He 9WY7A7 


ouwhj- i acuVaung enzyiuo suuunu i 


4 2 


inftQIQ AA/IOnQQC U« *J17Cfc 

luoyiy AA4yuooD ns.zi/D0 


cols 




i uoyzu Artiyuoyy ns.zyoo^o 


CCTo 
CO IS 


Q.C. 


1ARQA1 AAAQR90A He 9T7Q71 
lugoHi rtrtH jozuh no. 4.0/ o/i 


P<?Te 

CO I 9 


4 


AAAQ£^vt7 Vie H14A 
1Uv9h£ AA4a0j4i nS.jlOlH 


reunoDiasiorna-oinainQ proiein i 


4 A 


1AftQ/1fl AAj40£7flR U«s 01fV77 


Mrtrtuo j£ proiein 


H 


mCOCfl A A COACH Uc ORR11 


Co 1 5, vvsaKiy suniiar 10 nypuuicucai ho. l 


4 4 


10£G73 A ACACIA < Ue 410/71 
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CO IS 


R 7 


luoyyo M/voaMoi ns. iso**©** 


UldUUIInlAw yiUWUI taU-UI U1IUJU ty UlUlcUl 


18.7 


1fl7flflA AACQfl71fl We 7^7A7) 

i ur uuo nrtoyo t 1 u ns.^ jf w 


COIq 


fi2 


107098 AA5QQ914 He 94143 

lUfU&O rtrtJJ»JfclH no.tt IHQ 


CCTe 
CO 19 


4.1 


AC\7(W) AAC.QQA77 He OATMXQ 
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1 Ut Uj J rtrtDw IHY . ns.o/ *t 1 
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SB 
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Col S 
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CO IS 


A 
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fi7 
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1071S1 AA621169 He RRA7 


ESTs 
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dkfzp5odc1o21 protein 


7 0 

7.4 


1 07252 uoy?f 1 n3.£>9Z5 


cctv* 
colS 


7 0 


1072S5 T34327 nS.oU12u 


UDP^-axetj^^ha-O^atactosEirnine^po 


0.0 


107299 T4UJ27 nS.oUbbl 


King resistance-related protein 


A 4 


107324 TolooS Hs.276422 


DKFZP585G 1 122 protein 


r.o 


107372 Uoop25 Hs.8297 


ribonudease 6 precursor 


>l 7 

4./ 


107373 U85773 Hs 154695 

til V WWW 1 IV IMilVTUvu 


Dhosnhomannamutasfl 2 


4.8 


107481 W58247 Hs^79766 


Homo sapiens Wnesin superfamily motor 


6.3 


107531 Y13936 Hs.17883 


protein phosphatase 1G (formerly 2C); ma 


8.3 


107859 AA024835 Hs.47584 


potassium voltage-gated channel; delayed 


7.3 


107890 AA026030 Hs.61311 


ESTs; Weakly similar to CALPAIN 2; LA 


7.3 


107908 AA026894 Hs.42826 


ESTs 


4.9 


108039 AA041341 Hs.46670 


ESTs 


5.4 


108040 AA041551 Hs.159971 


ESTs 


8.4 


108102 AA046424 Hs.49433 


ESTs; Weakly similar to HYPOTHETIC* 


6.6 


108217 AA058686 Hs.62568 


ESTs 


7.7 


108255 AA063157 Hs.172608 


ESTs 


4 


108358 AA071514 Hs.1634 


ESTs 


4 


108609 AA100694 Hs.69499 


Human DtyA sequence from BAC 15E1 o 


5.5 


.108647 AA112396 Hs.44276 


ESTs; Moderately similar to H0ME0B0 


14.3 


108676 AA115562 Hs.274417 


Homo sapiens mRNA; cDNA OKFZp564 


5.2 


108687 AA120785 Hs.54347 


ESTs 


5.6 


108695 AA121315 Hs.70823 


KIAA1077 protein 


10.5 


108733 AA126422 


zn84fU1 Stratagene lung carcinoma 9372 


4.4 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CELL GROW 4.6 

108828 M131584 Hs.273344 DKFZP564O0463 protein 5.5 

108872 AA134063 Hs.111680 ESTs 7.2 

108884 AA134958 Hs.293591 ESTs 11.3 

5 108893 AA135894 Hs.194691 retinoic add Induced 3 8.9 

109008 AA156360 Hs.87128 ESTs 14.7 

109010 AA156460 Hs.44229 dualspedficity phosphatase 12 4.9 

109011 AA156542 Hs.72127 ESTs 4.6 
109042 M159525 Hs.71779 Homo sapiens DNA from chromosome 19 7.2 

10 109086 AA166695 Hs.270737 tumor necrosis factor (ligand) superfamDy 4 

109090 M167006 Hs.70499 ESTs 5.9 

109101 AA167708 Hs.52184 ESTs 4.2 

109112 AA169379 Hs.257924 ESTs 4 

109160 AA179387 Hs.301997 DKFZP434N126 protein 4 

15 109166 AA179845 Hs.73625 RAB6 interacting; kinesin-like (rabkinesin 13.6 

109178 AA181600 Hs.283707 ESTs 11.8 

109179 AA181902 Hs.192789 ESTs; WeaWy similar to IJH ALU SUBFA 5.4 
109261 AA195255 Hs.61779 ESTs 6.7 
109270 AA195515 Hs.3585 ESTs; Weakly sirnOar to alternatively spD 4.9 

20 109277 AA196332 Hs.86043 ESTs 5.4 

109313 AA206800 Hs.86276 ESTs; Moderately slmBar to zinc finger p 5.5 

109415 AA227219 Hs,110826 trinucleotide repeat containing 9 20.1 

109454 AA232255 Hs.295232 ESTs 4.7 

109467 AA232904 Hs.63187 ESTs 6.8 

25 109481 AA233342 Hs,289069 ESTs; Weakly similar to WD40 protein C 10.6 

109508 AA233892 Hs.55902 ESTs; Weakly similar to III! ALU SUBFA 8 

109514 AA234087 Hs.262346 ESTs; Weakly similar to ORF2: function 8.2 

109572 F02027 Hs.171937 ESTs 4.8 

109632 F04165 Hs.235873 ESTs; Weakly similar to K11C42[C.eleg 5.2 

30 109644 F04477 Hs.291531 ESTs; Moderately simHar to GLYCERAL 6.6 

109703 F09684 Hs.24792 ESTs; Weakly similar to ORFYOR283W 7.1 

109726 F10009 Hs.9196 ESTs . 5 

109747 F10161 Hs.22969 ESTs - 4.7 

109799 F10770 Hs.180378 Homo sapiens done 669 unknown mRNA 4.5 

35 109814 F10979 Hs.153106 Homo sapiens done 23728 mRNA sequen 8.7 

110189 H20543 Hs.6278 DKFZP586B1 621 protein 16.6 

110240 H25577 Hs.176588 ESTs; WeaWy similar to CYTOCHROME 6.2 

110280 H29285 Hs.32468 ESTs 4.5 

110520 H56965 Hs.4082 yr09i06^1 Soares fetal liver spleen 1NFL 5.7 

40 110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUfTIN-CO 19.5 

110707 H95079 Hs.15617 ESTs; Weakly similar to HI! ALU SUBFA 6.2 

110734 H98714 Hs.24131 ESTs 30.2 

110770 N22262 Hs.131705 ESTs 5.8 

110780 N23174 Hs.22891 solute carrier family 7 {cationic amino ad 8.2 

45 110787 N24716 Hs.12244 ESTs; Weakly similar to C44B9.1 [C.eleg 6.7 

110794 N25262 Hs.27931 ESTs 5.9 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 4 

110818 N29454 Hs.27552 ESTs; Weakly similar to putative p150 (H 4.3 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 12.8 

50 110844 N31952 Hs.167531 Homo sapiens mRNAfufl length Insert cO 10.1 

110854 N32919 Hs.27931 ESTs 4.7 

110856 N33063 ESTs; Weakly similar to S164 IH^apiens 45 

110860 N33438 Hs.170065 ESTs 125 

110897 N39148 Hs,6B80 DKFZP434D156 protein 4 

55 * 110915 N46252 Hs,29724 ESTs 23.2 

110935 N48787 Hs.305979 protease Inhibitor 1 (anfrelastase); alpha- 4.8 

110970 N51374 Hs.96870 Homo sapiens mRNA full length Insert cO 5.4 

111006 N53375 Hs.166146 Homer; neuronal immediate eany gene; 3 4.7 

111008 N53388 Hs,7222 ESTs 133 

60 111018 N54067 Hs.3628 rnitogen-adivated protein kinase kinase Id 5.7 

111084 N59543 Hs.15456 PDZ domain containing 1 8.3 

111100 N62522 Hs.20450 ESTs 143 

.111125 N63823 Hs.269115 ESTs 7.9 

111132 N64378 Hs.83293 ESTs; Highly similar to unknown function 4.4 

65 111139 N64683 Hs390943 ESTs 6 

111164 N66857 Hs.14808 ESTs; Weakly similar to till ALU CLASS 4.1 

111172 N67102 HS51851 Homo sapiens mRNA; cDNA DKFZp586 55 
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111178 N67227 Hs.24633 ESTs 

111179 N67239 Hs.10760 ESTs 

111181 N67278 Hs.171802 ESTs; Weakly similar to hypothetical pro 

111184 N67437 Hs.243901 Homo sapiens mRNA; cDNA OKFZp564 

5 111221 N68869 Hs.15119 ESTs 

111223 N68921 Hs.297939 ESTs; Weakly similar to neogenin (H.sap 

111229 N69113 Hs.110855 ESTs 

111241 N69514 Hs.288880 ESTs; Weakly similar to CGW2 protein [ 

111268 N70481 H.3.26118 Homo sapiens done 24766 mRNA sequen 

10 111295 N73275 Hs.21275 ESTs; Weakly similar to ublquitin-conjug 

111299 N73808 Hs.24936 ESTs 

111336 N79565 Hs.29894 ESTs 

111357 N91023 Hs.87128 ESTs 

111370 N92915 Hs.94631 brefeWin A-inhibited guanine nucteotide-e . 

15 111806 R33468 Hs.279008 ESTs 

111825 R35885 Hs.286148 stromal antigen 1 

111836 R36228 Hs.25119 ESTs 

111890 R38678 Hs.12365 ESTs 

111923 R39995 Hs.25925 Homo sapiens clone 23860 mRNA sequen 

20 111942 R40576 Hs.21590 ESTs 

111987 R42036 Hs.6763 KIAA0942 protein 

112101 R44793 Hs.296341 adenytyt cyctase-assoclated protein 2 

112134 R46025 Hs.7413 ESTs 

112197 R49482 Hs.5637 ESTs 

25 112244 R51309 Hs.70823 WAA1077 protein 

112253 R51618 Homo sapiens mRNA; cDNA OKFZp566 

112305 R54822 Hs.26244 ESTs 

112449 R63802 Hs.124186 ring finger protein 2 

112483 R66534 Hs.285885 ESTs 

30 112519 R68631 Hs.11861 ESTs 

112610 R79392 Hs.23643 ESTs 

112693 R88741 Hs.91065 ESTs; Moderately similar to proliferation 

112751 R93507 Hs.8207 ESTs 

112801 R97486 Hs.157160 protein kinase; DNA-activated; catalytic p 

35. 112869 T03313 Hs.4747 dyskeratosis congenita 1;dyskerin 

112871 T03352 Hs.12285 ESTs 

112908 T10065 Hs.3530 TLS-associated serine-arginine protein 

112966 T17119 Hs.102548 glucocorticoid receptor DNA binding fact 

112971 T17185 Hs.83883 ESTs 

40 112995 T23528 Hs.7155 ESTs; Weakly similar to TYKJ protein (M 
113047 T25B67 Hs.7549 ESTs 

113075 T34660 Hs.6986 ESTs; Weakly similar to III! ALU SUBFA 
113117 T47819 Hs.159153 ESTs 

113206 T58044 Hs.241471 ESTs; Moderately similar to UU ALU SU 
45 113248 T63857 yc16e1.s1 Stratagene lung (#93721) Homo 

113260 T64896 Hs,287420 ESTs 

113277 T65797 Hs.11774 protein (peptidyl-prolyl cis/trans isomeras 

113278 T65802 Hs.11135 yd 1M0.s1 Stratagene lung (#937210) Ho 
113440 T86121 Hs.191445 ESTs 

50 113523 T90037 Hs.95549 ESTs 
113604 T92735 Hs.296083 ESTs 

113702 T97307 ESTs; Moderately similar toll!! ALU SU 

1 13783 W19222 Hs.7041 ESTs; Weakly similar to llll ALU SUBFA 
113794 W37382 Hs.11090 ESTs 
55 113808 W44735 Hs.9286 ESTs 
113811 W44928 Hs.6994 ESTs 

113822 W47350 Hs.17466 retlnoteacJd receptor responder(tazaroten 

113823 W47388 Hs.55099 rab6 GTPase activating protein (GAP and 
113836 W56792 Ks.12040 ESTs; Weakly similar to KIAA0881 prate 

60 113857 W65477 Hs.5297 Homo sapiens mRNA; cONA DKFZp564 
113886 W72471 Hs.23920 ESTs 
113895 W73738 Hs.12921 ESTs 

113923 W80763 Hs.3B49 ESTs; Weakly similar to FK506-binding p 
113931 W81205* Hs.3496 ESTs 
65 113950 W85765 Hs.30504 Homo sapiens mRNA; cONA OKF2p434 
113970 W86748 Hs.8109 ESTs 
114051 W94942 Hs.177534 dual specificity phosphatase 10 
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114057 W96222 Hs.34192 ESTs 

114086 Z38266 Hs.288649 Homo sapiens PAC done DJ0777O23 fro 

114098 Z38347 Hs.1 18338 ESTs; Weakfy similar to similar to S. cere 

114109 Z38435 Hs.184108 ribosomal protein L21 

5 114124 Z38595 Hs.125019 ESTs; Highly similar to WAA0886 prate 

114138 Z38763 Hs.15740 amyloid beta (A4) precursor protein-bindi 

114149 Z38814 Hs.27195 ESTs 

114162 Z38909 Hs.22265 ESTs 

114177 239062 Hs.23740 ESTs 

10 114196 Z39211 Hs.150926 hicose-1-phosphateguanylyItransferase 

114208 Z39301 Hs.7859 ESTs 

114250 Z39897 Hs.13297 ESTs 

114251 Z39898 Hs^1948 ESTs 

114292 Z40715 Hs.184641 detta-6 fatty acid desaturase 

15 114297 Z40758 Hs.173091 DKFZP434K151 protein 

114334 Z41342 Hs.22941 ESTs 

114460 AA024604 Hs.26102 ESTs 

114471 AA028074 Hs.104613 ESTs 

114480 AA032243 Hs.151678 UDP^cetyUipha^alactosamine^o 

20 114518 AA046407 Hs. 108469 suppressor of varl (S.cerevisiae) 3-Jike 1 

114542 AA05576B Hs.293380 ESTs 

114549 AA056484 Hs.292833 ESTs 

114652 AA101416 Hs.107149 ESTs; Weakly similar to PT&ASSOCIAT 

114673 AA1 13303 Hs.95583 transmembrane 4 superfamily member (to 

25 114698 AA126951 Hs.1 10857 ESTs; Highly similar to putative DNA-dir 

114767 AA148885 Hs.1 54443 rrinlchromosome maintenance deficient (S 

114799 M159323 Hs.109929 ESTs 

114804 AA160363 Hs.269956 ESTs 

114811 AA161161 Hs.95907 multiple inositol polyphosphate phosphata 

30 114821 AA165313 Hs.55468 ESTs 

114852 AA235035 Hs.38260 ESTs; Moderately similar to ubiquilin spe 

114901 AA236276 Hs.196437 ESTs; Weakly similar to R26660 J; parti 

114902 AA236359 Hs.39504 ESTs 
114940 AA243012 Hs.75928 ESTs 

35 114965 AA250737 Hs.72472 ESTs 

115047 AA252627 Hs.82916 homeoboxB5 

115054 AA252863 Hs.87729 ESTs 

115061 AA253217 Hs.41271 ESTs 

1 15082 AA255557 Hs.198269 NADH dehydrogenase (ubiquinone) 1 alp 

40 115116 AA256486 Hs.62275 ESTs 

115140 AA258030 Hs.279938 ESTs; Weakly similar to supported by GE 

115205 AA262470 Hs.284216 ESTs 

115206 AA262491 Hs.186572 ESTs 

115239 AA278650 Hs.73291 ESTs; Weakly similar to similar to the bet 

45 115242 AA278755 Hs.283732 ESTs 

115249 AA278961 Hs.71124 ESTs 

115259 AA279071 Hs.13453 splicing factor 3b; subunit1;155kD 

115285 AA279799 Hs.293736 ESTs 

115291 AA279943 Hs.122579 ESTs 

50 115357 AA281793 Hs.72988 ESTs 

115377 AA282247 Hs.193063 ESTs 

115400 AA283198 Hs.89113 ESTs 

115439 AA284561 Hs.193090 ESTs 

115471 AA287138 Hs.59346 ESTs; Weakly similar to ASPARTYL-TR 

55 115506 AA292537 Hs.45207 Human DNA sequence from clone 620E1 

115522 AA331393 Hs,47378 ESTs 

115572 AA398392 Hs.59594 ESTs; WeaWy similar to F33G12.3 gene p 

115587 AA399264 Hs.283037 ESTs; Highly similar to HSPC039 protein 

115600 AA400247 Hs.42173 ESTs 
60 115612 AA400948 Hs.71243 ESTs; WeaWy similar to zinc finger prote 
115646 AA404352 Hs.305971 ESTs 
115652 AA405Q98 Hs.38178 ESTs 

115657 AA405620 Hs.55158 ESTs; Weakly similar to weak similarity t 

115658 AA405625 Hs.183056 Human DNA sequence from clone 34B21 
65 115675 AA406546 Hs.82065 Homo sapiens mRNA; cONA DKFZp564 

115721 AA417102 Hs50960 ESTs 
115763 AA421560 ESTs 
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115764 AA421562 Hs.91011 anterior gradient 2 (Xenepus laevis) homo 

115835 AA428576 Hs.41371 ESTs 

115844 AA430124 Hs.7773 ESTs . k 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak slmilanty 

5 115888 AA435839 Hs.76591 KIAA0887 protein 

115922 AA441911 Hs.71869 ESTs; Weakly similar to WAA0926 prate 

115941 AA443602 Hs.46679 ESTs 

115947 AA443793 Hs.94761 ESTs 

115948 AA443793 Hs.43445 poly(A)-specdic ribonudease (deadenylat 
10 115951 AA443918 Hs.301048 cofdin 1 (norHnuscie) 

115967 AA446887 Hs.42911 ESTs 

115984 AA447687 Hs.91109 ESTs 

116009 AM49448 Hs.44238 ESTs 

116024 M451748 Hs.83883 Human DNA sequence from clone 718J7 

15 116028 M452112 Hs.42644 tNoredoxWike 

116050 AA453656 Hs.88417 ESTs 

116097 AA456099 Hs.176376 ESTs 

116108 AA457566 Hs38777 ESTs 

116121 AA459254 Hs.48855 ESTs 

20 116127 M459703 Hs.279884 v-myc avian myelocytomatosis viral onco 

116129 AA459956 Hs.49163 ESTs; Highly similar to putative ribonucle 

116142 AA460649 Hs.39457 ESTs 

116204 AA465701 Hs.108646 ESTs 

116221 AA478397 Hs.50180 ESTs 

25 116222 M478415 Hs.89986 ESTs 

116238 M479362 Hs.47144 DKFZP586N0819 protein 

116246 AA479961 Hs.250646 ESTs; Highly similar to ubiquitin-conjuga 

116249 AA480886 Hs.86693 ESTs 

116250 AA480975 Hs.44829 ESTs uAWeTcnni D 
30 116254 AA481146 Hs.41086 ESTs; Weakly similar to OXYSTEROL-B 

116256 AA481256 Hs.88201 ESTs; Weakly similar to lysophosphofipa 

116264 AA482594 Hs.272239 Homo sapiens mRNA; cDNA DKFZp586 

116265 AA482595 Hs.55189 ESTs; Weakly similar to F25B5.3 [C.eteg 
116282 AA486550 Hs.204501 ESTs; Weakly similar to Wiskott-Aldrich 

35 116298 AA489046 Hs.94109 ESTs 

116300 AA489194 Hs.159471 ESTs; Weakly similar to snRNP protein B 

116327 AA490959 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 

116334 AA491457 Hs.48948 ESTs 

116337 AA496127 Hs.44070 ESTs 

40 116351 AA504116 Hs.82501 Homo sapiens mRNA; cONA DKFZp434 

116357 AA504806 Hs.90797 Homo sapiens done 23620 mRNA sequen 

116415 AA609204 Hs.27973 WAA0874 protein 

116443 AA620313 Hs.190488 ESTs; Weakly similar to KERATIN; TYP 

116470 C13992 Hs.83484 ESTs 

45 116480 C14088 giyceraldehyde-3iJh<)spretedehydrogena 

116578 D51272 Hs.75337 nucleolar phosphoprotein p130 

116579 051276 Hs.81915 teukemia-assodated phosphoprotein p18 
116626 F02028 Hs.81907 ESTs 

116647 F03089 Hs.15395 ESTs; WeaWy similar to ARGINYL-TRN 

50 116674 F04816 Hs.92127 ESTs 

116680 F08813 Hs.273629 LINE retrotransposable element 1 

116700 F09983 Hs.317589 ESTs 

116724 F13665 Hs.65641 ESTs 

116726 F13681 Hs.53913 ESTs 

55 116732 F13779 Hs.165909 ESTs 

116734 F13789 Hs.93796 DKFZP586D2223 protein 

116760 H11054 Hs.155342 protein kinase C; delta 

116780 H22566 Hs.30098 ESTs 

116786 H25836 Hs.301527 tumor necrosis factor {Egand) supertamiiy 

60 116787 H28581 Hs.15641 ESTs 

116790 H29532 Hs.101174 microtubute-assodated protein tau 

116803 H47357 Hs.109701 ESTs; Moderately similar to weak similar 

116877 H68116 Hs.168732 ESTs 

116921 H72948 Hs.821 blglycan 

65 117216 N20083 Hs.42792 ESTs 

117232 N20579 Hs.61153 ESTs 

117284 N22162 Hs.183779 ESTs; Weakly similar to cONA EST yk33 
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117344 
117367 
117392 
117394 
117412 
117498 
117557 
117634 
117639 
117754 
117852 
117879 
117924 
117950 
117992 
116138 
118215 
118229 
118265 
118336 
118363 
118429 
118470 
118472 
118475 



118528 
118542 
118600 
118695 
118698 
118901 
118952 
118976 
118986 
118989 
119027 
119042 
119075 
119260 
119271 
119298 
119302 
119341 
119495 
119580 
119602 
119620 
119676 
119717 
119729 
119805 
119859 
119867 
119873 
119899 
119940 
119943 
119970 
120131 
120150 
120206 
120241 
120255 
120314 
120325 
120352 



N24046 
N24954 
N26175 
N26257 
N26722 
N31726 
N33920 
N36421 
N36923 
N47469 
N49408 
N50050 
N51056 
N51394 
N52000 
N57773 
N62195 
N62339 
N62827 
N63604 
N64168 
N66158 
N66769 
N66B18 
N66845 
N67149 
N67889 
N68010 
K69222 
N71781 
N72113 
N90719 
N929S6 
N93629 
N94362 
N94439 
N99256 
R05316 
R36451 
T15916 
T16387 
T23820 
T25725 
T62571 
W35390 
W42451 
W462B6 
W47620 
W60473 
W69134 
W69747 
W73788 
W80702 
W80852 
W81129 
W84767 
W86779 
W86835 
W87812 



Hs.210706 ESTs 
Hs.42502 ESTs 
Hs.93405 ESTs 
Hs^9871 KIAA0727 protein 
Hs.42645 ESTs 

Hs.44268 ESTs; Highly similar to myeOn gene expr 
Hs.44532 diubiquftin 

Hs.13323 ESTs; Weakly similar to SODIUM- AND 

Hs.44833 ESTs 

Hs.59757 ESTs 

Hs.136102 WAA0853 protein 

Hs.303025 ESTs; Weakly similar to keratin; 67K typ 

Ks.38891 ESTs 

Hs.75478 KIAA0956 protein 

Hs,172089 Homo sapiens mRNA; cDNA DKFZp586 

Hs.93560 ESTs; Weakry simflar to trg [R.norvegicu 

Hs.77910 3-hydroxy^eftylglularyl-Coenzyme A 

Hs.166254 heat shock 90kD protein 1; alpha 

Hs.48645 EST 

Hs.47166 ESTs 

Hs,48938 ESTs 

Hs.74649 ESTs 

Hs.291033 ESTs 

Hs.42179 ESTs 

ESTs; Weakly similar to 111) ALU CLASS 
Hs.50115 ESTs 
Hs.49397 ESTs 
Hs.49427 ESTs 

ESTs 

Hs.50081 Homo sapiens mRNA ftiB length insert cD 
Hs.50187 ESTs 
Hs.94445 ESTs 

ESTs; Highly similar to CGI-90 protein [H 
Hs.93391 ESTs 
Hs.125830 ESTs 
Hs.45105 ESTs 
Hs.114611 ESTs 
Hs,5472 ESTs 
Hs.287820 fibronectin 1 

Hs.102950 ESTs; Highly similar to coat protein gamm 
Hs.65328 ESTs 
Hs.155478 cycGnT2 
ESTs 

mlcrotubule-assoclated protein 7 
ESTs 

high-mobility group proton 2-fike 1 
ESTs; WeaWy similar to ZK1058.5 (Cele 



Z39549 
Z40805 
Z41815 
AA169752 
AA194168 
AA195651 
AA211400 



Hs.146388 
Hs.55533 
Hs.92260 
Hs.233694 
Hs.56009 
Hs.57787 
Hs.57987 
Hs.94806 
Hs.43213 
Hs.58461 
Hs.250696 
Hs.44865 
Hs.58698 
Hs.272531 
Hs.14158 
Hs.93581 
Hs.75887 
Hs.153746 
Hs.91668 
Hs.65946 
Hs.5672 
Hs.221040 
Hs.104106 
Hs.193172 



ESTs 
ESTs 

K1AA1062 protein 

ESTs 

ESTs 

KDEL (Lys-Asp-Glu-Leu) endoplasmic re 
Homo sapiens mRNA; cDNA DKFZp586 
ESTs 

DKFZP586B0319 protein 
copinelll 

Homo sapiens mRNA; cDKA DKFZp586 

coatomer protein complex; subunit alpha 

ESTs 

ESTs 

ESTs 

ESTs; WeaWy similar to Similarity to Yea 

WAA1038 protein 

ESTs 

ESTs 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



120428 AA236822 


Hs.173694 


K1AA1097 protein 


5.6 


120524 AA261852 


Hs.192905 


ESTs 


5.6 


120528 AA262107 


Hs.104413 


ESTs 


4.5 


120571 AA280738 


Hs.34892 


ESTs 


4.9 


120649 AA287115 


Hs.192843 


ESTs 


4.5 


120655 AA287347 


Hs.238205 


ESTs 


6.7 


120668 AA287833 


Hs.292913 


ESTs 


8.3 


120712 AA292654 


Hs.102506 


eukaryotic translation initiation (actor 2 a) 


4.6 


120713 AA292655 


Hs.96557 


ESTs 


10.6 


120724 AA293470 


Hs.100747 


ESTs 


5.4 


120873 AA358015 




EST 


7.1 


120885 AA365515 


Hs.301872 


ESTs; Moderately similar to ill! ALU SU 


4.6 


120919 AA381125 


Hs.301444 


ESTs 


8.2 


120948 AA397822 


Hs.104650 


ESTs; Highly similar to similar to mago n 


8.6 


120969 AA398116 


Hs.1 29206 


casein kinase 1; gamma 3 


10.5 


120977 AA398155 


Hs.97600 


ESTs 


10.9 


121103 AA398936 


Hs.97697 


EST 


7.4 


121291 AA401753 


Hs.8186 


lung cancer candidate 


5.3 


121320 AA403008 


Hs.301927 


T-cell receptor; alpha (V;D;J;C) 


13.5 


121463 AA411745 


Hs.239681 


ESTs; Weakly similar to K1AA0554 prote 


8.9 


121596 AA416740 


Hs.174104 


ESTs 


22.6 


121723 AA419622 


Hs.104800 


ESTs; Weakly similar to Mouse 19.5 mRN 


8 


121748 AA421171 


Hs.234545 


ESTs 


5.6 


122125 AA434411 


Hs.98806 


ESTs 


5.3 


122522 AA449444 


Hs.98969 


ESTs 


4 


122655 AA454756 


Hs.97837 


ESTs 


4 


122704 AA456326 


Hs.99445 


ESTs 


6.2 


122782 AA459894 


Hs.99472 


ESTs 


5.3 


122856 AA463740 


Hs.75367 


Src-like-adapter 


13.1 


122882 AA465381 


Hs.108812 


ESTs; Weakly similar to B0041 .5 [C.eleg 


5.5 


122928 AA476578 


Hs. 101 840 


ESTs 


6.3 


122974 AA478625 


Hs.194215 


ESTs 


6. 


122997 AA479295 


Hs.106290 


Keich motif containing protein 


12.5 


123016 AA480103 


Hs.323231 


ESTs; Weakly similar to alternatively spli 


4.4 


123107 AA486071 


Hs.104207 


ESTs 


8.3 


123111 AA486273 


Hs.191721 


ESTs 


4.2 


123114 AA486407 


Hs.129928 


ESTs; Moderately similar to K1AA0454 p 


5.2 


123136 AA487449 


Hs.194024 


ESTs 


4,2 


123137 AA487468 


Hs.100686 


ESTs; Weakly similar to secreted cement 


14.6 


123169 AA488692 




ESTs' Weaktv similar to Gaa-Pol oofvora 

1 *Jf 1 1 VUAJ J wJ#liU0J VJ VM>|T Sit fAHjjitSi 


4.5 


123176 AA489020 


Hs.69233 


ESTs 


5.2 


123338 AA504249 


Hs 187585 


ESTs 


4 


123436 AA598714 


Hs 223014 


nrnteasfi* serine* 15 


7.3 


123442 AA598803 


Hs.1 11496 


ESTs 


5.9 


123449 AA598899 


Hs 112493 


Ha/no saoiens mRMA" cDNA DKFZnfi64 


4*1 


123494 AA599786 


Hs.112110 


ESTs 


4* 


123503 AA600121 


Hs.293156 


ESTs 


12.8 






ESTs* Weaktv similar in (lit Al II 91 IRPA 
tola, VV canty alllUlai UJ till ALU OUOrM 


7 Q 


123619 AA6Q9200 




ESTs 


93 1 


123673 AA609471 


Hs.158549 


ESTs 


6.6 


123729 AA609778 


Hs.278672 


membrane component; chromosome 11; s 


4.7 


123819 AA620636 


Hs.112264 


ESTs 


4 


123960 AA621785 


Hs.287733 


methytmaionate-semialdehyde dehydroge 


7.6 


124000 057317 


Hs.74861 


activated RNA polymerase It transcription 


4.4 


124006 D603O2 


Hs.270016 


ESTs 


20.6 


124012 D80240 


Hs.241471 


HUM5G11A Human fetal brain (TFujiwa 


6.7 


124021 F02859 


Hs.13974 


ESTs 


4.7 


124049 F10523 


Hs.74519 


primase; polypeptide 2A (58kO) 


4.7 


124059 F13673 


HS.2B3713 


ESTs 


7,7 


124243 H66710 


Hs.133525 


ESTs 


5.5 


124308- H93575 


Hs.241507 


Homo sapiens mRNA; cDNA DKFZp564 


11.4 


124314 H94877 . 


Hs.215766 


GTP-binding protein 


13.7 


124315 H94892 


Hs.288757 


v-ral simian leukemia viral oncogene horn 


14 


124350 N21359 


Hs.101282 


Homo sapiens mRNA; cONA DKFZp434 


8.6 


124352 N21626 


Hs.102406 


ESTs 


7.2 


124357 N22401 




yw37g07.s1 Morton Fetal Cochlea Homo 


52 


124390 N29325 


Hs.7535 


ESTs; Highly similar to COBW-fike place 


7.9 
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24438 N40188 Hs.11090 ESTs 

24447 N48000 Homo sapiens mRNA; cDNA DKFZp586 

24457 N50114 Hs266175 ESTs 

I24539 NS3172 Hs.146409 cell division cycle 42 (GTP-binding prate 

526 N74604 Hs.11090 ESTs 

24632 N79515 Hs.306117 interteukin 13 receptor; alpha 1 

24644 N91279 Hs.109654 ESTs; Moderately similar to outer membr 

24676 R01037 Hs.181013 phosphoglycerate mutase 1 (brain) 

24677 R01073 ESTs; Weakly similar to !IH ALU CUSS 
24724 R12405 Hs.1 12423 Homo sapiens mRNA; cONA DKFZp586 
24773 R40923 Hs.106604 ESTs 

24777 R41933 ESTs 

24792 R44357 Hs.48712 ESTs; Weakly similar to cONA EST EMB 

24857 R63652 Hs.137190 ESTs 

24911 R88992 Hs.180612 ESTs 

24955 T10598 Hs.324841 ESTs; Weakly similar to till ALU SUBFA 

24958 T11134 Hs.431 murine leukemia viral (bmM) oncogene h 

25038 T78089 Hs.270134 ESTs 

25092 T92544 Hs.137548 CD84 antigen (leukocyte antfcen) 

25132 W15495 Hs.129781 chromosome 21 open reading frame 5 

25144 W37999 Hs.24336 ESTs 

25154 W38419 ESTs 

25243 W86423 Hs.105413 ESTs 

25279 W93640 Hs.4779 ESTs; Moderately similar to similar to AD 

25299 Z39436 Hs.102720 ESTs 

25303 Z39821 Hs.288193 ESTs 

25304 Z39833 Hs.124940 GTP-binding proteh 

25474 AA151216 Hs.75103 tyrosine 3-monooxygenase/tryptophan 5-m 

25509 AA044232 Hs.288967 ESTs 

25580 AA126504 Hs.267812 sorting nexin 4 

25582 AA507383 Hs.74649 cytochrome c oxidase subunit Vic 

25670 AI432621 Hs.82685 CD47 anflgen (Rlwelated antigen; integri 

25698 AA748483 Hs.191356 general transcription factor HH; polypepti 

25745 AI283493 Hs.75722 rtoophorinll 

25852 H09290 Hs.76550 Homo sapiens mRNA; cDNA DKFZp554 

25972 AA434562 Hs.35406 ESTs 

26160 N90960 Hs.265398 ESTs; Weakly similar to transformatiorK 

26257 N99638 Hs.124084 tumornecrosis factor receptor superfamliy 

26337 AI066486 Hs.40500 similartoS. cerevislae RER1 

26405 U46278 Hs.122489 ESTs 

26537 W40262 Hs.146310 ESTs; Weakly similar to putative p150 [H 

90 W78968 Hs.181307 H3 histone; family 3A 

26712 AA205862 Hs.7942 ESTs 

26721 T72569 Hs.125359 Thy-1 cell surface antigen 

26764 A1334393 Hs.102178 ESTs 

26804 AI203334 Hs.160628 ESTs 

26819 AA305536 Hs.279607 ESTs 

26877 A1052047 Hs.26102 ESTs 

26991 R31652 Hs.821 biglycan 

27479 AA513722 Hs.179729 coBagen; type X; alpha 1 (Schmid melaph 

27514 AA826926 Hs.204214 ESTs 

27663 W07286 Hs.10340 ESTs; WeaWy similar to weak similarity t 

27677 AA916752 Hs.264190 ESTs; Highly similar to MEM3 [Mmuscu 

27814 AA761755 • Hs.136713 ESTs; Weakly similar to V4-1 {Ksapiens 

27997 A1281549 Hs.311054 ESTs 

28092 AA904617 Hs.166229 ESTs 

28218 H02682 Hs.292154 ESTs; Moderately similar to recombinatio 

28466 D59653 Hs241471 EST 

28482 U83908 Hs.296251 programmed cell death 4 

28517 AA280617 Hs.1 00861 ESTs; Weakly similar to p60 katanin [H.s 

28530 AA504343 Hs. 183475 Homo sapiens done 25061 mRNA sequen 

28559 AA226801 Hs. 101 448 metastasis associated 1 

28574 AA412048 Hs.38260 keratin 8 

95 U31875 Hs.152677 short-chain alcohol dehydrogenase family 

28610 L38608 Hs.10247 activated leucocyte ceD adhesion molecule 

29 AA399187 Hs.102708 OKF2P434A043 protein 

28649 AA142853 Hs.103106 Homo sapiens mRNA for G7b protein (G 
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4.9 

4.7 

4.4 

12.6 

4.1 

14.8 

6.7 

4.8 

5.3 

6.6 

5.8 

12.2 

102 

6.8 

8 

5.4 
4.1 
11.5 
4 

9.4 ' 

62 

25.9 

4.1 

16.4 

9.5 
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128651 AA446990 Hs.103135 ESTs 6.1 

128653 R48943 Hs.10315 solute carrier family 7 {caUonlc amino aci 4.4 

128656 AA458542 Hs.10326 coatomer protein complex; subunit epsilon 14.3 

128717 T30617 Hs.104222 Homo sapiens mRNA; cONA DKFZp566 245 

5 128727 M64174 Hs.50651 Janus kinase 1 (a protein tyrosine kinase) 7.3 

128764 N49308 Hs.104938 ESTs; Weakly similar to atyha 1(XVIll) c 9.2 

128781 X85372 Hs.105465 small nuclear ribonucteoproteln porypepti 5.4 

128793 W93562 Hs.105749 KIAA0553 protein 4.6 

128835 W15528 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586 4 

10 128845 AA455638 Hs.10649 basement membrane-induced gene 6.9 

128871 AA400271 Hs.106778 Homo sapiens mRNA for putative Ca2+4 4.5 

128922 AA252023 Hs.9589 ESTs; Weakfy similar to HRIHFB2157 [H 6.4 

128925 D61676 Hs.21851 Homo sapiens mRNA; cONA DKFZp586 6.4 

128938 AA410325 Hs.107260 ESTs 7 

15 128946 N29353 Hs.107318 kynurenine 3-monooxygenase (kynurenin 5.2 

128948 AA485655 Hs.223025 proteasomo (prosome; macropain) subunit 13.1 

128955 F10290 Hs.185807 Homo sapiens done 24758 mRNA sequen 5.8 

129005 AA460049 Hs.13323 ESTs; WeaWy similar to SODIUM- AND 12.6 

129009 AA131421 Hs.75607 ESTs 9.8 

20 129017 H13108 Hs.107968 ESTs 13.9 

129057 X62466 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 10.7 

129075 AA129465 Hs.83765 ESTs 4.7 

129095 L12350 Hs.108623 thrombospondin 2 4.4 

129124 AA234530 Hs.108802 N-ethylmaleimlde-sensieve factor 20.7 

25 129160 AA131252 Hs.109007 ESTs 5.9 

129164 AA282183 Hs.109045 ESTs 5.8 

129180 R40556 Hs.318401 ESTs; Highly similar to HSPC039 protein 7.6 

129224 X89109 Hs.109606 coronin; actin-bJndlng protein; 1A 12 

129229 AA211941 Hs.109643 polyadenylate binding proteirHnteracting 7.9 

30 129240 W24360 Hs.237868 Interieukin 7 receptor 5.3 

129241 AA435665 Hs.109706 ESTs; Moderately similar to HN1 (M.mus 8.4 

129243 H88033 Hs.109727 KIAA0733 protein 7.8 

129247 AA151574 Hs.109733 pUirv&ke transcription fector 6.4 

129259 AA090695 Hs.181385 ESTs 6.2 

35 129270 Z35227 Hs.109918 ras homotog gene family; member H 5.4 

129281 AA026318 Hs.289101 glucose regulated protein; 58kD 4.4 

129300 C20976 Hs.1 10165 ESTs; Highly similar to ribosomal protein 5.7 

129316 N93155 Hs.285976 ratrnodulin 1 (phosphorytase kinase; delta 7.7 

129319 AA037467 Hs.30340 ESTs 6 

40 129351 AA167268 Hs.62349 Human ras rnhibttormRNA; 3' end 9.3 

129366 H18027 Hs.184697 plexind 18.2 

129383 W92984 Hs.288224 ESTs 5.9 

129388 AA151621 Hs.110964 ESTs 4.1 

129391 T80814 Hs.11101 discs; large (DrosophHa) homolog 3 (neur 105 

45 129404 AA172056 Hs.317584 ESTs 5.3 

129406 N23707 Hs.111138 KIAA0712 gene product 4 

129426 AA412087 Hs.111323 EST; Highly similar to protein inhibitor o 8 

129453 AA421213 Hs.111632 Lsm3 protein 55 

129513 C00225 Hs.306163 ESTs; Weakly similar to fos39554_1 [H.s 5.5 

50 129519 AA298786 Hs.112242 ESTs 6.8 

129606 R21443 Hs.166254 heat shock 90kD protein 1; alpha 5 

129622 AA278243 Hs.323949 ESTs 6.8 

129626 AA447410 Hs.111334 ESTs; Weakly similar to 1111 ALU SUBFA 5.1 

129627 AA258308 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564 5.3 
55 129628 U26727 Hs.1174 cycErvdependent kinase Inhibitor 2A (me! 8.2 

129642 R50008 Hs.11806 7-dehydrochoIesterol reductase 4.3 

129663 AA442768 Hs.11866 translocase of inner mitochondrial membr 4.4 

129665 M88458 Hs.1 18778 KDEL (Lys-Asp-Glu-Uu) endoplasmic re 4 

129691 X06700 Hs.119571 collagen; type ill; alpha 1 (Ehlers-Danlos 6 

60 129783 AA454618 Hs.12479 associated molecule with the SH3 domain 6.4 

129800 AA252436 Hs.12540 rysophosphoGpase I 7.7 

129836 AA452161 Hs.206521 YME1 (S.cerevisIaeHIke 1 5 

129850 N20593 Hs.288932 GDP dissociation Inhibitor 2 6.9 

129869 AA102520 Hs.13015 ESTs; WeaWy similar to heat shock prote 5 

65 129896 AA043021 Hs.13225 UDP-GatbetaGlcNAcbeta1;4-galactosy 6.6 

129982 M87789 immunoglobulin gamma 3 (Gm marker) 4 

129985 AA450045 Hs.1 40452 cargo selection protein (mannose 6 phosp 5.8 



230 



WO 02/059377 



130029 AA236412 
130033 M90696 
130036 AA195260 
130069 AA055896 
130077 T24055 
130080 X14850 
130096 AA223874 
130114 AA234717 
130125 M36803 
130135 M61764 
130170 AA610070 
130189 D43947 
130208 AA620556 
130211 D50840 
130235 X14046 
130276 S75295 
130280 L13738 

130313 M620323 

130314 D86967 
130328 AA135673 
130356 X84373 
130367 Z38501 
130378 T47333 
130384 X66364 
130393 D13630 
130399 AA449417 
130407 N29888 
130414 M21121 
130417 U58522 
130421 D21260 
130441 U35835 
130455 X17059 

130498 L38951 

130499 AA416723 
130511 L32137 
130553 AA430032 
130558 H96654 
130568 AA232535 
130583 W24957 
130585 H66211 
130604 X03635 
130614 M132007 
130619 AA477739 
130622 AA235247 
130625 F03969 
130627 L23808 
130629 M60346 
130635 M87503 
130639 059711 
130677 H17861 
130681 D82808 
130693 AA487202 
130703 N63295 
130706 AA488843 
^30712 AA292066 

130714 X92896 

130715 T98227 
130744 AA203527 
130747 AA471293 
130751 AA435633 
130796 R39390 
130800 AA223386 
130855 AA425439 
130859 AA287327 
130866 M58028 
130880 D14678 
130891 D31891 



Hs.236510 
Hs.181301 
Hs.125849 
Hs.146428 
Hs.91379 
Hs.147097 
Hs.197955 
Hs.14992 
Hs.1504 
Hs.21635 
Hs.151469 
Hs.151761 
Hs.15250 
Hs.23703 
Hs.153053 
Hs.169149 
HS.153937 
Hs.154320 
Hs.154332 
Hs.154668 
Hs.155017 
Hs.8768 
Hs.155168 
Hs.166071 
Hs.155291 
Hs.155356 
Hs.155410 
Hs.241392 
Hs.155485 
Hs.178710 
Hs.155637 
Hs.155956 
Hs.180446 
Hs.158286 
Hs.1584 
Hs.252587 
Hs.15984 
Hs.16085 
Hs.293907 
Hs.16331 
Hs.1657 
Hs.16697 
Hs.12532 
Hs.16846 
Hs.260720 
Hs.1695 
Hs.1697 
Hs.1706 
Hs.17132 
Hs.17767 
Hs.17820 
Hs.17962 
Hs.18103 
Hs.201673 
Hs.279762 
Hs.18212 
Hs.171952 
Hs.18747 
Hs.6879 
Hs.18879 
Hs.19525 
Hs.19574 
Hs.143323 
Hs.20476 
Hs^055 
Hs.20d30 
Hs.20991 



ESTs; Moderate!/ similar to PFT27 [M.m 
calhepsinS 

ESTs; Moderately similar to ALU SU 

collagen; type V; alpha 1 

ribosomal protein L26 

H2A hlstone family; member X 

KIAA0704 protein 

ESTs 

hemopexin 

tubulin; gamma 1 

cakaum/calmodutin -dependent serine pro 
K1AA0100 gene product 
peroxisomal 03^2-enoyVCoA bomerase 
UDP*glucose ceramide glucosyitransferas 
C037 antigen 

karyopherin alpha 1 fimportin alpha 5) 
activated p21cdc42Hs kinase 
ubiquitin-activating enzyme E1C (homolo 
KIAA0212 gene product 
K1AA0391 gene product 



iiumcw ivwk^wi r 

ESTs; WeaWy similar to liii ALU SUBFA 

TATA box binding protein (TBP)-associa 

cydin-dependent Wnase 5 

WAA0005 gene product 

Homo sapiens mRNA for putative giucosy 

ESTs 

small inducible cytokine A5 (RANTES) 
huntingtinsnteracting protein 2 
dathrin; heavy potypeptide-like 2 
protein kinase; DNA^activated; catalytic p 
N^cetyftransferase 1 (arylamine N«acetyl 
karyopherin (importin) beta 1 
Homo sapiens mRNA for WAA0446 prot 
cartilage oBgomeric matrix protein (pseud 
pituitary tumor-transfoimlng 1 
ESTs; Weakly similar to gene pp21 protei 
ESTs; Highly similar to CGM3 protein {H 
ESTs; Moderately similar to similar to C.e 
ESTs 



ESTs 
ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metaltoproteinase 2 (gelatinase A; 
matrix metaltoproteinase 12 (macrophage 
ATPase; H+ transporting; lysosomal (vacu 
toterferon-stimutated transcripBon factor 
ESTs 
ESTs 

Rho-assoctated; arted-coi containing pro 

ESTs 

ESTs 

comchon-Cke 
adenylate cyclase 7 

DNA segment on chromosome X (unique) 
occtudin 

POP7 (processing of precursor S. cerevis 
ESTs 

Homo sapiens done 23965 mRNA sequen 
ESTs 

ESTs; Weakly similar to katanin p80 subu 

putative DNA/chromatffi binding motif 

reroktfpofusctnosls; neuronal 2; late infa 

ublquifin-ecfivafing enzyme E1 (A1S9T a 

Unesin-Gke2 

SET domain; bifurcated; 1 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



130905 
130913 
130919 
130921 
130944 
130974 
130987 
130999 
131010 
131046 
131091 
131153 
131185 
131200 
131206 
131210 
131227 
131244 
131245 
131257 
131319 
131339 
131388 
131410 
131472 
131475 
131501 
131514 
131524 
131528 
131544 
131557 
131562 
131564 
131586 
131587 
131589 
131615 
131664 
131679 
131684 
131687 
131689 
131693 
131710 
131716 
131742 
131762 
131781 
131795 
131809 
131814 
131838 
131877 
131885 
131891 
131925 
131930 
131941 
131965 
131970 
131971 
131974 
131977 
131994 
131997 
132017 



AA056489 
W03592 
AA291710 
AA074596 
M97935 
X57985 
R45698 
N48963 
AA435748 
X02530 
T35341 
H11760 
M25753 
AA609427 
M044078 
AA430047 
AA429472 
038076 
AA620599 
AA256042 
U25997 
M463450 
R34531 
H84658 
AA608962 
Z39053 
AA121127 
X02152 
N39152 
D60856 
N33236 
D30946 
U90551 
AA491465 
AA235385 
M15182 
U52100 
D14533 
AA136126 
AA136660 
U26174 
L11066 
AA599653 
W60913 
AA233225 
D49738 
031352 
H46831 
AA460450 
N32724 
176517 
AA437226 
AA091932 



AA044095 
AA158258 
AA248470 
AA205460 
062657 
W90146 
086960 
R70167 
AA410424 
F09788 
AA479515 
D82399 
W67251 



Hs.129998 
Hs.21198 
Hs.21276 
Hs.194688 
Hs.21486 
Hs.2178 
Hs.21893 
Hs.21992 
Hs.169341 
Hs.2248 
Hs.22880 
Hs.23606 
Hs.23960 
Hs.293732 
Hs.24210 
Hs.95549 
Hs.236522 
Hs.24763 
Hs.24766 
Hs.24908' 
Hs.25590 
Hs.25812 
Hs.92200 
Hs.279836 
Hs.27258 
Hs.27263 
Hs.8207 
Hs.2795 
Hs.301804 
Hs.28309 
Hs.28555 
Hs.28707 
Hs.28777 
Hs.28792 
Hs£6966 
Hs.183868 
H&29191 
Hs.192803 
Hs.30327 
Hs.30579 
Hs.3066 
Hs.3069 
Hs.30896 
Hs.1 10796 
Hs.30985 
Hs.31053 
Hs.31433 
Hs.107767 
Hs.31989 
Hs.32317 
Hs.3260 
Hs.157 
Hs.180628 
Hs.156346 
Hs.3402 
Hs.30376 
Hs.183180 
Hs.69476 
Hs.35086 
Hs.35962 
Hs.3610 
Hs.154938 
Hs.268122 
Hs.3622 
Hs.279882 
Hs.136644 
Hs.267659 



ESTs 

translocase of outer mitochondrial membr 
collagen; type IV; alpha 3 (Goodpasture a 
bfomodomain adjacent to zinc finger dom 
signal transducer and activator of transcrip 
H2B histone family; member Q 
ESTs; WeaMy similar to cAMP Inducible 
WAA0689 protein 

ESTs; Weakly similar to phosphatide acid 
small Inducible cytokine subfamily B (Cy 
ESTs; Highly similar to dfpepfidyl peplid 
ESTs 
cydin B1 

ESTs; Moderately similar to !!!! ALU SU 

ESTs 

ESTs 

OKFZP434P106 protein 
RAN binding protein 1 
OKF2P564E1962 protein 
ESTs 

stannbcalcin 

Nljmegen breakage syndrome 1 (nlbrin) 

KlAA0480gene product 

ESTs 

caicycTin binding protein 
ESTs 

H3 histone; family 3A 
lactate dehydrogenase A 
ESTs 

UDP-glucose dehydrogenase 

ESTs; Weakly similar to B051 1.8 [C.eteg 

signal sequence receptor, gamma (transtoc 

H2A histone family; member L 

ESTs 

ESTs; Moderately similar to alternatively 
glucuronidase; beta 



xeroderma pigmentosum; complementatio 
mitog en-activated protein kinase-activated 
ESTs 

granzyme K (serine protease; granzyme 3 
heat shock 70kD protein 9B (mortalin-2) 
transcription factor-like 5 (basic helix-loo 
ESTs; WeaWy similar to cONA EST yk45 
MRS1 protein 

cytoskele ton-associated protein 1 
ESTs 

ESTs; Moderately similar to CaM-KII inh 
DKFZP586G1722 protein 
Sox-Oke transcriptional factor 
presenfiin 1 (Alzheimer disease 3) 
Interteukin 10 receptor; alpha 
dynamln-Bke protein 
topoisomerase (DNA) II alpha (170kO) 
ESTs 

heterogeneous nudear protein similar to r 
ESTs; Weakly similar to RING finger pro 
ESTs 

ubiquitin-specific protease 1 
ESTs 

K1AA0205 gene product 
ESTs 

Homo sapiens mRNA; cONA DKFZp586 
procollagen-proline; 2-oxoghjtarate 4-dtox 
Human DMA sequence from done 703H1 
Homo sapiens done 23714 mRNA sequen 
Homo sapiens vav 3 oncogene (VAV3) m 

232 



B.7 

205 

9 

5.3 

18.8 

13.4 

8.5 

7.2 

5.2 

10.1 

6.3 

7.3 

6.2 

4.3 

5.5 

7.1 

5.6 

5.5 

6.7 

5.8 

8.9 

6.5 

9.2 

12.1 

18.1 

7.5 

5.5 

5.1 

4.3 

8.4 

5.6 

8.7 

18.8 

11.8 

4.7 

52 

4.4 

4.6 

4.3 

9.4 

9.7 

6.2 

8.3 

9 

5.2 
6.6 
11 
4.9 
9.2 
45 
5.4 
4 

6.7 
5 

11.1 

5.6 

4.5 

14.3 

6.2 

6.3 

4.2 

4.3 

4.6 

6.4 

12 

10 

4.7 
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132021 T68246 Hs.306079 
132065 D82226 Hs.211594 
132085 D44466 Hs.3887 
132089 AA131971 Hs.39122 
132109 AA599801 H3.40098 
132143 AA257056 Hs.7972 
132149 T10822 Hs.324743 
132153 N90141 Hs.41066 
132160 AA281770 Hs.295923 
132164 U84573 Hs.41270 
132180 AA405569 Hs,418 
132183 L19183 Hs.199695 
132225 AA128980 
132227 AA412620 Hs.4248 
132235 F09058 Hs.42656 
132256 AA608856 Hs.431 
132298 N41849 Hs.7120 
132314 AA285290 Hs.44499 
132325 N37065 Hs.44856 
132384 AA479933 Hs.46967 
132387 R70914 Hs.281434 
132393 W85888 Hs.47334 

132406 F09979 Hs.4774 

132407 AA431459 Hs.47783 
132413 AA132969 Hs.260116 
132446 AA426218 Hs.48764 
132465 AA047896 Hs.49169 
132482 AA429478 Hs.238126 
132492 T03749 Hs.4990 
132528 AA283006 Hs.50758 
132540 M488987 Hs.5097 
132543 AA417152 Hs.5101 
132580 137042 Hs.283738 
132586 AA412452 Hs.52515 
132608 AA199588 Hs.5321 

132616 AA386264 Hs.283558 

132617 AA171913 Hs.5338 
•132618 AA253330 Hs.279916 

132640 U33821 Hs.5437 
132668 AA453614 Hs.5460 
132694 M60830 Hs.5509 
132700 N47109 Hs.5521 
132724 AA417962 Hs.55498 
132738 W42674 Hs.264636 
132742 M490862 Hs.292812 
132744 X54326 Hs.55921 
132795 H99152 Hs.57079 
132807 AA331777 Hs.57301 
132811 U25435 Hs.57419 
132817 AB004884 Hs.57553 
132840 N23817 Hs.5807 
132845 D62588 Hs.5813 
132847 T48195 Hs.58189 
132856 W79865 Hs.58367 
132869 N26855 Hs.203961 
132874 AA425776 Hs.58609 
132880 AA444369 Hs.177537 
132894 D82422 Hs.5944 
132900 N56451 Hs£978 

132903 AA235404 Hs.5985 

132904 X83618 Hs.59889 
13290$ M142857 Hs.234896 
132914 AA496037 Hs.60293 
132918 AA252605 Hs.6051 
132936 AB002305 Hs.6111 
132951 U04209 Hs.61418 
132957 AA234791 Hs.61469 



chaperonln containing TCP1; submit 5 (e 
prate asome (prosome; macropain) 26S sub 
proteasome (prosome; macropain) 26S sub 
ESTs 
ESTs 

K1AA0871 protein 
ESTs 

ESTs; Moderately similar to ELONGAT1 

seven in absentia (Drosophiia) homoiog 1 

prottDagen-iysine; 2-oxoglularate 5-dioxy 

fibroblast activation protein; alpha; sepras 

hypothetical protein 

ESTs 

ESTs 

ESTS 

murine leukemia viral (bmi-1) oncogene h 
Homo sapiens cytokine receptor related p 
smaflEDRK-fich factor 2 
ESTs 

Human DNA sequence from clone 167A1 

heat shock 70kD protein 1 

ESTs; Moderately similar to Di! ALU SU 

ESTs 

ESTs 

K1AA1 104 protein 

ESTs 

ESTs 

ESTs; Highly similar to CGI-49 protein [H 
K1AA1089 protein 

chromosome^associated polypeptide C 
synaptogyrin 2 

protein regulator of cytokinesis 1 

casein kinase 1; alpha 1 

DKFZP434N024 protein 

ARP3 (actin-related protein 3; yeast) horn 

isocttrate dehydrogenase 2 (NADP+); mil 

carbonic anhydrase XI! 

adaptor-related protein complex 1; gamma 

Taxi (human T-cefl leukemia virus type I 

KIAA0776 protein 

ecotropic viral integration site 2B 

ESfs 

geranyigeranyl diphosphate synthase 1 
ESTs; Moderately similar to neuronal thre 
ESTs; Weakly similar to C43H8.1 [C.eleg 
glutamytyrotyWRNA synthetase 
ESTs 

mutt. (E. cofi) homoiog 1 (colon cancer, n 
transcriptional repressor 
tousled-like kinase 2 

Homo sapiens done 23675 mRNA sequen 
ESTs 

eukaryotic translation initiation factor 3; s 

glyplcan4 

ESTs 

ESTs 

ESTs 

ESTs 

UM domain only 7 

Homo sapiens clone 25186 mRNA sequen 
3^ydroxy-3-methytgiutaryMtoejuyme A 
ESTs; Highly similar to gemWn [Ksapie 
ESTs 

KIAA0616 protein 
WAA0307 gene product 
rntcrofMar-asscdated protein 1 
Human gene from PAC 753P9; chromoso 
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132959 AA028103 Hs.61472 ESTs; Weakly similar to unknown [S.cere 18.9 

132968 N77151 Hs.61638 myosin X 5.8 

132984 H80409 Hs.62112 zinc finger protein 207 4.3 

132990 AA458761 Hs.18387 transcription factor AP-2 alpha (activating 4.2 

5 132994 AA505133 Hs.279905 solute canter famtfy 2 (facUitated gtucxise 26.4 

132998 Y00062 Hs.170121 protein tyrosine phosphatase; receptor typ 4.4 

133002 AF005082 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 4.7 

133005 C21400 Hs.278605 WAA0970 protein 6.6 

133015 A.A047036 Ns.246315 ESTs 7.9 

10 133016 W81298 Hs.62B9 growth factor receptor-oound protein 2 5.2 

133039 X62055 Hs.63489 protein tyrosine phosphatase; non-recepto 4 

133050 S67325 Hs.63788 proptonyl Coenzyme A carboxylase; beta 5.2 

133056 AA071387 Hs.6396 jumping translocation breakpoint 5 

133062 R33663 Hs.64056 ESTs 5.4 

15 133083 N70633 Hs.6456 chaperonin containing TCP1; subun'it 2 (b 6 

133091 AA122147 Hs.64691 KIAA0483 protein 5 

133093 AA598749 Hs.285996 ESTs 5.6 

133124 M156049 Hs.267923 ESTs 4.1 

133126 016469 Hs.6551 ATPase;H+traiispc^;lys<)somal(vacu 6.2 

20 133196 R37367 Hs.6727 Ras-GTPase activating protein SH3 doma 5.1 

133214 Y10659 Hs.285115 interteukin 13 receptor, alpha 1 6.2 

133225 241415 Hs.6823 ESTs; Weakly similar to intrinsic factor-B 8.3 

133228 N90029 Hs.6831 Homo sapiens done 1400 unknown prate 4.7 

133239 AA059405 Hs.179882 Homo sapiens clone 24655 mRNA sequen 5.5 

25. 133240 D31161 Hs.242894 ESTs 9 

133257 AF006086 Hs.6895 adin related protein 2/3 complex; subunlt 7.7 

133264 W72187 Hs.69192 ESTs; Weakly similar to cONA EST yk37 6.7 

133274 AA488886 Hs.6949 ESTs 4.2 

133281 AA421079 Hs.69594 ESTs; Weakly similar to Sox-Oke transcri 4.9 

30 133283 AA410507 Hs.6968 ESTs 4.3 

133287 L15702 Hs.69771 B-factor; properdin 9.3 

133294 R79723 Hs.69997 zinc finger protein 238 30.4 

133297 AA600057 Hs.70266 KIAA09O5 protein 10.4 

133318 AA256168 Hs.152316 ESTs 8.5 

35 133362 H06195 Hs.7194 ESTs; Highly similar to CGI-59 protein [H 14 

133370 AA156897 Hs.72157 DKFZP5641 1922 protein 5 

133391 X57579 Hs.727 Inhibln; beta A (acthrin A; activin A8 alp 13.9 

133395 AA491296 Hs.72805 ESTs 4.3 

133422 N79516 Hs.73287 ESTs; Weakly similar to eyelid (D.melano 4.5 

40 133431 AA255438 Hs.7358 Homo sapiens mRNA; cDNA OKFZp566 8 

133435 T23983 Hs.323966 ESTs 5 

133449 AA094989 Hs.7381 vortage-oependent anion channel 3 8.7 

133468 XO3068 Hs.73931 major histocompatibility complex; class II 5 

133484 X78710 Hs.211581 mey-regulatorytranscripfion factor 1 5.3 

45 133506 AA316868 Hs.74346 ESTs; Weakly similar to 140G11.h [Djne 6.8 

133517 X52947 Hs.74471 gap Junction protein; alpha 1; 43kO (conn 5.7 

133551 D63480 Hs.278634 WAA0146 protein 4.8 

133569 AA313977 Hs.172772 transcription elongation factor B (Sill); po 9.5 

133572 W94333 Hs.279915 translocase of Inner mitochondrial membr 5 

50 133577 F03717 Hs.75063 human immunodeficiency virus type I enh 7.4 

133589 L37368 Hs.75104 RNA-bindlng protein S1;serine-richdom 5 

133608 D13315 Hs.75207 giyoxalasel 4.2 

133617 AA148318 Hs.75249 WAA0069 protein 4.5 

133627 U09587 Hs.75280 gtycyMRNA synthetase 10 

55 133633 D21262 Hs.75337 nucleolar phosphoprotein p130 4.5 

133634 U24166 Hs.234279 ntotubule-associated protein; RP/EB fa 155 

133640 D83004 Hs.75355 ubiqtiitin-conjugating enzyme E2N (homo 9.1 
133644 D89077 Hs.75367 Src4ik^adapter 6.4 

133649 AA479139 Hs.75393 add phosphatase 1; soluble 4.8 

60 133652 AA287383 Hs.7540 ESTs 4.2 
133674 AA458946 Hs.75497 ESTs 4.3 
133700 K01396 Hs.297681 protease inhibitor 1 (anMastase); alpha- 6.3 
133705 N21648 Hs.75659 MpV17 transgene; murine homolog; glom 4.6 
133716 Y00282 Hs.75722 ribophorinll 7 5 

65 133720 L27841 Hs.75737 pencentoolar material 1 9.4 
133752 U49278 Hs.75875 ubiquiQn-conjixjafing enzyme E2 variant 4.5 
133765 D21255 Hs.75929 cadherinU (OB^erin; osteoblast) 6.4 
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133772 W73693 Hs.76038 
133774 Z23090 Hs76067 
133776 J03473 Hs.177766 
133784 AA214305 Hs.301064 
5 133814 M33882 Hs.76391 
133829 M453783 Hs.76550 
133834 AA147510 Hs.288660 
133839 M59815 Hs.170250 
133842 U73477 Hs.285013 
10 133845 T68510 HsJ6704 
133859 U86782 Hs.178761 

133867 D43948 Hs.76989 

133868 U58090 Hs.183874 
133871 AA454597 Hs.182793 

15 133893 X01060 Hs.77356 
133914 N32811 Hs.77542 
133918 W72783 Hs.58382 
133944 AA045870 Hs.7780 
133946 AA156565 Hs.173878 

20 133963 L34587 Hs.184693 
133980 D00760 Hs.250811 
133990 C02374 Hs.7822 
133999 M28213 Hs.78305 
134030 J03077 Hs.78575 

25 134032 Z81326 Hs.78589 

134045 S82470 Hs.78768 

134046 028473 Hs.172801 
134064 D87685 Hs.78893 
134070 H98621 Hs.78946 

30 134087 U51166 Hs.173824 
134090 M22382 Hs.79037 
134098 X06323 Hs.79086 
134110 U41060 Hs.79136 
134132 U32519 Hs.220689 

35 134168 AA398908 Hs.181634 
134170 M63138 HsJ9572 
134208 U88871 Hs.79993 
134258 L28010 Hs.808 
134288 AA430008 Hs.8117 

40 134310 AA313414 Hs.8148 
134326 U16306 Hs.81800 
134329 D38551 Hs.81848 ' 
134331 AA452020 Hs.111222 
134351 R82074 Hs.82109 

45 134357 L43575 
134363 M37033 
134367 X54199 

134374 D62633 

134375 AA412720 Hs.82389 
50 134376 X02874 Hs.82396 

134381 U56637 
134388 -M15841 
134395 L09717 
134399 H99801 

55 134401 AA243746 Hs.211577 
134405 J04177 Hs.82772 
134415 AA329274 Hs.82911 
134417 D87969 Hs.82921 
134419 L08044 Hs.82961 

60 134421 AA122386 Hs.82985 
134423 W96151 Hs.83006 
134438 AA449984 Hs246857 
134448 T25732 Hs.83419 
134453 X70683 Hs.83484 

65 134470 X54942 Hs.83758 
134487 R38185 Hs.83954 
134495 D63477 Hs.84087 



Hs,82171 
Hs.82212 
Hs.82285 
Hs.8236 



Hs.184270 
Hsi2575 
Hs.8262 
Hs.82689 



isopentenykfiphosphale delta bomerase 
heat shock 27kD protein 1 
ADP-ribosyltransferase (NAD+; poly (AD 
ESTs 

myxovlrus (influenza) resistance 1; homol 
Homo sapiens mRNA; cONA OKFZp564 
serine protease; umbilical endothelium 
complement component 4A 
putative human HLA class II associated p 
ESTs 

26S proteasome-associated pad1 homobg 

WAA0097 gene product 

culfin4A 

ESTs 

transferrin receptor (p90; CD71) 
ESTs 

ESTs; Weakly similar to C13F10.5 [Oele 
Homo sapiens mRNA; cONA DKFZp564 
4-nItrophenylphosphatase domain and non 
transcription elongation factor B (Sill); po 
proteasome (prosome; macro pain) subunit 
Homo sapiens mRNA; cDNA OKFZp564 
RAB2; member RAS oncogene family 
prosaposin (variant Gaucher disease and v 
protease inhibitor 12 (neuroserpin) 
BB1 

tsoleucine-tRNA synthetase 
WAA0244 protein 
cullin 3 

thymine-ONA gfycosyiase 
heat shock 60kD protein 1 (chaperonin) 
ribosomal protein; mitochondrial; L3 
UV-1 protein; estrogen regulated 
Ras-GTPase-activaling protein SH3-doma 
Human Chromosome 16 BAG done CIT9 



heterogeneous nuclear ribonucleoprotein F 
ESTs 

Homo sapiens clone 24856 mRNA sequen 
chondroitin sulfate proteoglycan 2 (versic 
RAD21 (S. pornbe) homolog 
ESTs; WeaWy similar to CGM28 protein 
syndecan 1 

Human clone 191B7 placenta expressed m 
CD53 antigen 

phosptorirraylgtycinanude formyltransfe 
ESTs 

ESTs; Highly similar to CGM 18 protein 
2*;5'-olfgoadenylate synthetase 1 
capping protein (actin filament) muscle Z- 
smali nuclear ribonucleoprotein polypepti 
rysosomal-assoclated membrane protein 2 
tumor rejection antigen (gp96) 1 
kinectin 1 (Wnesin receptor) 
collagen; type XI; alpha 1 
protein tyrosine phosphatase type IVA; m 
solute carrier tarnfly 35 (CMP-slalic add t 
trefoil factor 3 (intestinal) 
collagen; type V; alpha 2 
ESTs; Highly similar to CGH39 protein 
ESTs; Highly similar to proteine kinase JN 
K1AA0252 protein 

SRY (sex determining region YH>ox 4 
CDC28 protein kinase 2 
Homo sapiens unknown mRNA 
K1AAQ143 protein 
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7.1 

S3 

13.7 
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6.4 
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11.9 
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4.7 
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4.5 

9.4 

4.4 

6.6 

8.6 

9.3 

6.3 

4.3 

6.9 

7.4 

6.1 

8.6 

6.1 

4.4 

6.6 

5.3 

4.8 

152 

7.2 

6.4 

4 

5.7 

6.9 

4.5 

112 

15.3 

4.1 

4.2 

5.9 

5.8 

4.4 
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4.6 
5.1 
20.3 
5 

16.1 
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134498 
134506 
134529 
134570 
134582 
134600 
134623 
134654 
134655 
134675 
134711 
134714 
134722 
134776 
134806 
134810 
134840 
134843 
134853 
134866 
134868 
134885 
134982 
134989 
134992 
134993 
135010 
135015 
135029 
135032 
135037 
135059 
135071 
135083 
135117 
135144 
135154 
135218 
135237 
135243 
135335 
135349 
U5367 
135389 
135400 
135411 
300019 
300021 
300022 
300089 
300107 
300254 
300328 
300549 
300711 
300921 
301124 
301165 
301576 
301604 
301704 
301782 
301884 
301936 
302002 
302032 
302067 



M63180 

U45328 

H24460 

U66615 

AA234966 

R68884 

X74496 

W23625 



Hs.84131 

Hs.84285 

Hs.848 

Hs.172280 

Hs.86041 

Hs.86347 

Hs.86976 

Hs.8739 



AA454070 Hs.123090 
AA250745 Hs.87773 



X04011 
U89922 
W47183 
J05582 
Z49099 
M27394 
U51477 
H60595 
D82348 
U84011 
Z39762 
N27670 
N46086 



Hs.88974 

Hs.890 

Hs.284226 

Hs.89603 

Hs.89718 

Hs.89751 

Hs.89981 

Hs.90061 

Hs.90280 

Hs.904 

Hs.90419 

Hs.9071 

Hs.92308 



Hs.173685 
Hs.278589 



AA236324 Hs.92381 
H05625 Hs.5831 
AA282343 Hs.301005 
059675 Hs.92927 
U54999 Hs.278338 
AA224180 
AA243497 
U77948 
AA598449 Hs.93832 
L08069 Hs.94 
AA495950 Hs.94262 
W52493 Hs.94694 
AA044842 Hs.95260 
AA126433 Hs.267812 
D31157 Hs.324277 
AA454930 Hs.9691 
AA215333 Hs.97101 
H20989 Hs.198281 
D83174 Hs.9930 
AA480109 Hs.9963 
U05237 Hs.99872 



Hs.99915 
Hs.99947 



M23263 
L10333 
M97935 
M97935 
M97935 

A1199738 Hs.208275 
AI694585 Hs.270464 
AW079607 Hs.188417 
AW015860 Hs.224623 
AA699328 Hs.298119 
AI492179 Hs.166244 
AW293224 Hs.232165 
T79326 Hs.298262 
N85789 Hs.190186 
AI682905 Hs.270431 
AA373124 Hs.24809 
AA526313 Hs.293691 
N99399 Hs,143046 
AA312082 Hs.105445 
NMJHM694HS.114924 
AF013956 Hs.5637 
NMJ01992HS.128087 
Hs.222399 



threonyl-tRNA synthetase 
ubiquitin-oonjugating enzyme E2) (homo! 
FK506-binding protein 4 (5?kD) 
SW1/SNF related; matrix associated; acGn 
CGG triplet repeat binding protein 1 
ESTs; Weakly similar to predicted using G 
prolyl endopeptfdase 
ESTs; Weakly similar to ORF YGR200c I 
ESTs 

protein kinase; cAMP-dependent; catatyti 
cytochrome b-245; beta polypeptide (chro 
lymphotoxin beta (TNF superf amity; mem 
ESTs; Weakly similar to neural F box pro 
mucin 1; transmembrane 
spermine synthase 

membrane-spanning 4-domains; subfamily 
diacylglyceroi kinase; zeta (104kD) 
progesterone binding protein 
S^rwimidazoM-carboxamWe rtbonuc 
amyb-l^^iucosidase; 4-alpha-gtucanotr 
WAA0882 protein 

progesterone membrane binding protein 
ESTs 

Homo sapiens mRNA; chromosome 1 spe 
ESTs 

purine- rich element binding protein B 
ESTs 

LGN protein 

ESTs; Moderately similar to 17-beta-hydr 

Human DNA sequence from clone 30M3 

general transcription factor II; i 

Homo sapiens clone 24483 unknown mRN 

heat shock protein; DNAJ-like 2 

ESTs 

Homo sapiens clone 24837 mRNA sequen 
Homo sapiens mRNA; cON A DKFZp586 
sorting nexfn4 

ESTs; Weakly similar to growth (actor-res 
ESTs 

putative G protein-coupled receptor 
pyruvate kinase; muscle 
collagen-binding protein 2 (cofltgen 2) 
TYRO protein tyrosine Mnase binding pro 
fetal Alzheimer antigen 
androgen receptor (dihydrotestosterone re 
retlculon 1 

AFFX control: STAT1 

AFFX control: STAT1 

AFFX control: STAT1 

ESTs; Weakly simBar to 1111 ALU CLASS 

ESTs; Weakly similar to !!il ALU CLASS 

ESTs; Weakly similar to ZnT-3 |H.sapien 

ESTs 

ESTs 

ESTs; Weakly similar to cONA EST yk40 
ESTs 

ESTs; Weakly similar to AI88J8.1 (H^api 
ESTs; Weakly similar to PTERIN-4-ALP 
ESTs; Weakly similar to (111 ALU SUBFA 
ESTs; Weakly similar to C17G10.1 [CeJe 
ESTs 

EST cluster (not in UniGene) with exon h 
GDNF family receptor alpha 1 
EST cluster (not In UniGene) with exon h 
chromobox homotog 4 (DrosophUa Pc da 
EST cluster (not in UniGene) with exon h 
ESTs; Weakly similar to protetn-tyroslne 
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302145 NNL003613HS.151407 
302236 AI128606 Hs.6557 
302276 NKL004448Hs.323910 
302290 AL117607 Hs.175563 
302326 NWL004271HS.184018 
302342 AB023141 Hs.190386 
302372 AL117406 Hs.200102 
302422 AB021227 Hs.3743 
302431 AF129530 Hs.226434 
302501 AF022726 Hs.251446 
302505 ALG49650 Hs.247B74 
302533 L36149 Hs.248116 
302638 AA463798 Hs,102696 
302656 AW293005 Hs.70704 
302792 AA343696 Hs.46821 
302820 XQ4588 Hs.85844 
302838 U66049 Hs.82171 
302892 N5B545 Hs.42346 
302977 AW263124 Hs.315111 
302989 N46406 Hs.84700 
303007 AA478876 Hs.317714 
303052 AF140242 Hs.279926 

303131 AW081061 Hs.103180 

303132 AI929819 Hs.4Q55 
303153 U09759 Hs.246857 
303387 AA908797 Hs.180799 
303499 AI815990 Hs.293515 
303502 AA488528 

303576 T07216 Hs.301226 
303620 AA397546 Hs.1 19151 
303634 A1953377 Hs.28444 
303642 AW299459 Hs.111977 
303654 M436942 Hs.288529 
303733 AW502496 Hs.15220 
303780 AW24014 Hs.18995 
303792 C75094 Hs.199839 
303842 A1337304 Hs.126268 
303951 AW475081 Hs.172928 
304465 AA421948 
304507 AA456426 
304591 AA505702 
304601 AA507875 
304659 AA533185 
305040 AA630582 Hs.169476 
305134 AA653159 Hs.179661 
305415 AA725116 Hs78465 
305453 AA738110 
305898 AA872838 
305913 AA876109 
305950 AA884479 
306004 AA889992 Hs.2186 
305009 AA894560 Hs.283370 
306060 AA906161 Hs.76277 
306398 AA970548 Hs.297681 
306505 AA987722 Hs.172928 
306576 AA995761 Hs.276Q92 
307117 A1184111 Hs.76067 
307138 AI185516 Hs.172928 
307187 AI190870 Hs.276417 
307542 A1280859 Hs.62954 
307554 A1281603 Hs.172928 
307806 AI351739 Hs.276726 
308079 AW72733 Hs.270208 
308307 AI581398 Hs.172928 
308511 AI687580 Hs.169476 
308615 AI738593 Hs,101774 
308677 AI761173 



EST cluster (not in UniGene) wifo exon h 
zinc finger protein 161 
EST cluster (not In UniGene) with exon h 
Homo sapiens mRNA; cONA OKFZp564 
EST cluster (not in UniGene) with exon h 
WAA0924 protein 

Homo sapiens mRNA; cDNA DKFZp434 
matrix metailoproteinase 24 (membrane-tn 
EST duster (not in UniGene) with exon h 
EST duster (not in UniGene) with exon h 
multiple UniGene matches 
chemokine (C motif) XC receptor 1 
ESTs; Weakly similar to C11D2.4 [C.eteg 
ESTs 

ESTs; Weakly similar to putative [H.sapie 

EST duster (not in UniGene) with exon h 

EST duster (not in UniGene) with exon h 

hlstone deaoetylase 3 

EST duster (not in UniGene) with exon h 

EST duster {not In UniGene) wift exon h 

pallid (mouse) homolog; paiOdin 

EST duster (not in UniGene) with exon h 

acQn-Qke6 

ESTs 

mitogen-actlvated protein kinase 9 

ESTs 

ESTs 

EST duster (not in UniGene) wifo exon h 
EST duster (not in UniGene) with exon h 
ESTs 

ESTs; Weakly similar to predicted using G 
EST duster (not In UniGene) with exon h 
ESTs 

ESTs; Weakly similar to zinc finger prate 

ESTs; Moderately similar to KIAA0456 p 

ESTs; Highly simfer to NG22 (Hsapiens 

ESTs; Weakly similar to similar to PD2 d 

collagen; type 1; alpha 1 

EST singleton (not in UniGene) with exon 

EST 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
glyceraldehyde-3-phosphate dehydrogena 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
keratin 8 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
heat shock 27kD protein 1 
collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
ESTs 

collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not to UniGene) with exon 
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ESTs 
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ESTs 


7.1 


313258 AW068358 Hs.183918 


ESTs 


13.7 


313328 AW449211 Hs.105445 


ESTs 


27.9 


313352 AW292127 Hs.144758 


ESTs 


9.8 


313417 AA741151 Hs.137323 


ESTs 


8.2 


313455 AW081702 Hs.98571 


ESTs 


6.9 



238 



WO 02/059377 PCT/US02/02242 



313590 AA804410 Hs.291677 
313663 A1953261 Hs.169813 
313567 U69201 Hs.13684 
313749 AW450376 Hs.119004 
5 313832 AW271022 Hs.133294 
313881 AA535580 Hs.16331 
313915 AI959390 Hs.163443 
313955 A1858884 Hs.270647 
313974 AI310151 Hs.173524 

10 314097 AA648744 Hs.269493 
314129 AA228366 Hs.1 15122 
314359 AA205569 Hs.194193 
314384 AA535840 Hs. 162203 
314394 AI380563 Hs.130816 

15 314462 AA347951 Hs.326413 
314465 AA602917 Hs.156974 
314470 A1934422 Hs.30661 
314488 AA358265 Hs.182890 
314506 AA833655 Hs.206868 

20 314510 A1204418 Hs.190080 
314558 AI873274 Hs,190721 
314661 AA436432 Hs.324239 
314691 AW207206 Hs.136319 
314754 AW026761 Hs.134374 

25 314775 AI149880 Hs.188809 
314943 AI476797 Hs.184572 
314961 AW008061 Hs.231994 
314963 AI689617 Hs.200934 
315006 AI538613 Hs.298241 

30 315010 AA531082 Hs.240049 
315019 AA532807 Hs.105822 
315033 A1493046 Hs.146133 

315036 AA534953 Hs.163297 

315037 AW205863 Hs.133988 
35 315051 AW292425 Hs.163484 

315054 AI968598 Hs.78768 
315073 AW452948 Hs.257631 
315080 AA744550 Hs.136345 
315083 AI221325 Hs.205442 
40 315088 AA557351 Hs.152448 
315175 A1025842 Hs.152530 
315195 AA972756 Hs.44898 
315296 AA876905 Hs.125286 
315303 AW194364 Hs.128022 

45 315352 AA604799 Hs.136528 
315364 AA643602 Hs.155485 
315368 AW291563 Hs.104696 
315390 AI801565 Hs.200113 
315408 AW273261 Hs.216292 

50 315458 AA872000 Hs.116104 
315472 AA828850 Hs.165469 
315478 AA665612 Hs.120874 
315498 AA628539 Hs.116252 
315527 AI791138 Hs.116768 

55 315530 AI200852 Hs.127760 
315562 AA737415 Hs.152826 
315634 AA837085 Hs.220585 
315647 AA648983 Hs.212911 
315652 A1521489 Hs.3053 

60 315676 AW002565 Hs.124660 
315680 AA814309 Hs.123583 
315735 A1831760 Hs.155111 
315741 AA812168 Hs.122559 
315769 AA744875 Hs.189413 

65 315978 AA830893 Hs.119769 
315984 A1015862 Hs.131793 
316042 AW297979 Hs.170698 



EST duster (not in UniGene) 

ests 

ESTs; Weakly similar to choline kinase b 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to aitemativety spG 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

EST cluster (not in UniGene) 

ESTs 

ESTs 

ESTs 

cell division cycle 2; G1 to S and G2 to M 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to gene MAC25 pr 

EST 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to MULTIFUN 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to FIG-1 PROTEIN 
ESTs; Moderately similar to Hil ALU SU 
ESTs; Highly similar to serine protease [H 
ESTs 

ESTs; Weakly similar to alternatively spli 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to iill ALU SU 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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5.3 

20.7 
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6.1 

12 

8.3 

6.1 

12.7 

7.6 

13.9 

4.4 

5.1 

4.7 

11.9 

28.8 

16.1 

25.7 

123 

4.6 

4.8 

4.4 

5 

7.6 

4.9 

5.2 

4.8 

4.4 

22.4 

5.9 

8.8 

15 

6.3 

9.2 

8.1 

13.4 

5.4 

4.4 

10.4 

5 

14.7 
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Hs.190489 
Hs.158549 



316136 AA830808 Hs.124366 
316177 AI908272 Hs.293102 
316313 AA741300 Hs.202599 
316405 M757900 Hs.270823 
5 316480 AI749921 Hs.205377 
316564 AI743571 Hs.168799 

316714 AA809792 Hs.123307 

316715 AI440266 Hs.170673 
316828 AA828116 Hs.173076 

10 316869 A1954880 Hs.134604 
316905 AW138241 Hs.210846 
316943 AW014875 Hs.137007 
316949 AA856749 Hs.t24620 
317008 AW051597 Hs.143707 

15 317028 AA962623 Hs.189144 
317067 AI805392 Hs.325335 
317069 AI732892 
317210 AA490718 
317298 AI922374 

20 317658 AW139077 Hs.202217 
317674 AW294909 Hs.132208 
317685 AI798630 Hs.149997 
317836 AA983913 Hs.128929 
317881 A/827248 Hs.224398 

25 317902 AI828602 Hs.211265 
317916 AI565071 Hs.159983 
318042 AW294522 Hs.149991 
318053 AI074465 Hs.133469 
318064 AW296888 Hs.170939 

30 318070 AI024594 Hs.248942 
318073 AW167087 Hs.131562 
318146 AI040125 Hs.150521 
318186 AW016773 Hs.3709 
318481 AI291584 Hs.145921 

35 318566 At335361 Hs.226376 
318617 AW247252 Hs.75514 
318662 A1285898 Hs.294014 
318691 AW192139 Hs.181307 
. 318740 NNL002543HsJ7729 

40 318744 AI793124 Hs.144479 
318948 AA317274 Hs.13996 
319163 F15257 Hs27 
319478 R06841 Hs.270307 
319545 R83716 Hs.14355 

45 319668 NWL002731HS.87773 
, 319763 AA460775 Hs.6295 
319913 AA179304 Hs.271586 
319936 W22152 
319951 AA307665 

50 319962 H06350 
319977 AA632632 
320074 AA321166 Hs.278233 
320092 AF022799 Hs.1 13292 
320107 AA836461 Hs.291712 

55 320133 D63271 
320167 AA984373 
320187 T99949 
320211 AL039402 Hs.125783 
320401 U90449 Hs.152717 

60 320458 AI884396 Hs.24131 
320488 R31386 Hs.191791 
320521 N31464 Hs.24743 
320661 AA864846 Hs.1 15175 
320691 R61576 Hs.313951 

65 320699 R63161 Hs,118249 
320727 U96044 Hs.181125 
320993 AL050145 Hs.225986 



Hs.282929 

Hs.14559 

Hs.135056 



Hs.90790 
Hs.303428 



ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to !l!l ALU SUBFA 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to RENAL SODIU 

ESTs 

ESTs 

EST duster (not In UniGene) 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; WeaWy similar to HYPOTHETICA 
ESTs 

nucleoside phosphatase 
ESTs 

H3histone; family 3A 

EST duster (not in UniGene) 

ESTs 

ESTs 

glydne dehydrogenase (decarboxylating; 

EST duster (not in UniGene) 

ESTs 

EST duster (not In UniGene) 
ESTs 

ESTs; Moderately similar to (HI ALU SU 
EST duster (not in UniGene) 
ESTs 
ESTs 

EST duster (not in UniGene) 
EST duster (not in UniGene) 
calpain9(nCL-4) 
EST ddister (not in UniGene) 
EST duster (not in UniGene) 
EST duster (not in UniGene) 
EST duster (not in UniGene) 
DEME-6 protein 

nudeoslde diphosphate kinase type 6 (inn 
ESTs 

EST duster (not in UniGene) 
ESTs 

EST duster (not in UniGene) 

hypothetical protein 

EST duster (not in UniGene) 

EST duster (not In UniGene) 

Homo sapiens mRNA; cONA DKFZp586 
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321012 AA737314 Hs.194324 EST duster (not In UniGene) 

321050 AW393497 EST duster (not in UniGene) 

321051 AF134149 Hs.240395 EST duster (not in UniGene) 
321171 AI769410 Hs.221461 ESTs 

5 321192 AA295304 Hs.297939 ESTs; WeaWy similar to neogenln.lH.sap 

321354 AAD78493 EST duster (not in UniGene) 

321387 H68014 Hs.141278 ESTs; Weakly similar to HI! ALU SUBFA 

321412 AW366305 Hs.22891 EST duster (not In UniGene) 

321489 AW392474 Hs.172759 ESTs; Moderately similar to til! ALU SU 

10 321539 N98619 Hs.42915 ARP2 (ac&welated protein 2; yeast) horn 

321593 HB4762 Hs.253197 ESTs 

321666 D28390 Hs.272897 EST duster (not In UniGene) 

321891 AW157424 Hs.165954 ESTs 

321910 H67065 Hs.271530 ESTs; Weakly similar to 111) ALU SUBFA 

15 321953 AW068268 Hs.292833 ESTs; Weakly simBar to 1111 ALU CLASS 

321978 N77342 Hs.21851 EST duster (not in UniGene) 

322017 AA310039 Hs.9192 ESTs 

322026 AA233527 Hs.283675 low density lipoprotein receptor (familial 

322035 AL137517 Hs.306201 EST duster (not in UniGene) 

20 322171 AF085968 Hs.48474 EST cluster (not in UniGene) 

322175 AF085975 EST duster (not In UniGene) 

322236 AL134970 Hs.104222 foflistaWke 1 

322303 W07459 Hs.157601 EST cluster (not in UniGene) 

322735 AA086123 Hs.297856 EST duster (not in UniGene) 

25 322777 AA679082 Hs.269947 ESTs 

322818 AW043782 Hs.293616 ESTs 

322882 AW248508 Hs.279727 OiGeorge syndrome critical region gene 2 

322975 C16391 EST duster (not in UniGene) 

322991 C18965 Hs.159473 ESTs 

30 323011 AA580288 EST duster (not in UniGene) 

323091 AW014094 Hs.210761 ESTs 

323107 AI301107 Hs.15079Q ESTs 

323136 AL120351 Hs.30177 EST duster (not in UniGene) 

323168 AL120862 Hs.124165 ESTs 

35 323195 AI064982 Hs.117950 muitftinctional polypeptide similar to SA 

323201 AL049370 Hs.13350 Homo sapiens mRNA; cDNA DKFZp586 

323203 AA203135 Hs.130186 ESTs 

323243 W44372 Hs.1 10771 EST cluster (not In UniGene) 

323244 T70731 Hs.193620 EST duster (not In UniGene) 
40 323328 AA228078 Hs.255096 EST duster (not in UniGene) 

323332 AI829520 Hs.227513 ESTs 

323333 AA228883 Hs.208558 EST duster (not In UniGene) 
323570 AL038623 Hs.208752 ESTs; Weakly similar to 111! ALU SUBFA 
323604 A1751438 Hs.41271 ESTs; Weakly similar to llll ALU SUBFA 

45 323685 AA344205 Hs.289088 EST duster (not in UniGene) 
323753 AA327102 Hs.70266 EST duster (not in UniGene) 
323817 AA410943 EST duster (not in UniGene) 

323845 A1684674 Hs.41127 ESTs; WeaWy similar to waclaw (Dinelan 
323930 AA570698 Hs.8173 ESTs 

50 323997 AA844907 Hs.274454 EST duster (not in UniGene) 
324047 AA378201 Hs.271340 EST cluster (not In UniGene) 
324261 AL044891 Hs.269350 EST duster (not in UniGene) 
324302 AA543008 Hs.292471 ESTs; Weakly similar to HI! ALU SUBFA 
324338 AL138357 Hs.14507B ESTs 

5 5 324344 AW502000 Hs.46677 EST cluster (not In UniGene) 
324432 AA464510 Hs.152812 EST duster (not in UniGene) 
324495 AW501411 Hs.122489 ESTs; WeaWy similar to llll ALU CLASS 
324497 AW152624 Hs.136340 ESTs 
324598 AA502659 Hs.163986 ESTs 

60 324603 AW016378 Hs.292934 ESTs 

324620 AA448021 Hs.94109 EST cluster (not in UniGene) 

324727 AI610425 Hs.19597 ESTs 

324774 AI031771 Hs.132586 ESTs 

324783 AA640770 Hs.200994 EST duster (not in UniGene) 

65 324824 AI826999 Hs.224624 ESTs 
324826 AA704806 Hs.143842 ESTs 
324902 031323 Hs.271492 ESTs 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



324961 AA613792 

324987 T06882 Hs.172634 

324988 T06997 Hs.121028 
325146 AI064690 Hs.171176 
325622 

326213 
326474 
326816 
326817 
327110 
327196 
327283 
327313 
327450 
328059 
328304 
328492 
328857 
329357 
329373 
329655 



M23263 

AA449749 

H14624 

X03363 

D50692 

TIGR:HT544 

M13755 Hs.833 



329960 
330084 
330384 
330385 
330387 
330388 
330409 
330460 
330486 
330494 
330500 
330510 
330513 
330541 
330542 
330547 
330551 
330562 
330573 
330673 
330711 
330814 
330850 
330874 
330884 
330912 
330924 
330997 
331014 
331024 
331046 
331135 
331145 
331148 
331222 
331230 
331306 
331327 
331337 
331341 
331344 
331362 
331363 
331376 
331384 



M29696 

M34423 

M75099 

M81057 

U22970 

U23942 

U32989 

U39840- 

U49082 

U62800 

D57823 



Hs.237868 
Hs.79222 
Hs.227729 
Hs.180884 
Hs.265827 
Hs.226213 
Hs.183671 
Hs.299867 
Hs.76460 
Hs.83393 
Hs.321403 



AA164687 Hs.177576 
AA015730 Hs.265398 
AA075298 Hs.322710 
AA127474 Hs,191157 
AA133457 Hs.102548 
AA195936 Hs.82719 
AA232136 Hs.159737 
H55762 Hs.9302 



H98597 
N32919 
N66563 
R61398 
R72427 
R73816 
T98531 
W69807 



Hs.30340 
Hs.27931 
Hs.191358 
Hs.4197 
Hs.129873 
Hs.17385 
Hs.173904 
Hs.16537 



AA252079 Hs.63931 
AA281076 Hs.109221 
AA287662 Hs.50495 
AA303125 Hs,23240 
AA357927 Hs,126550 
AA417956 Hs.40782 
AA421562 Hs.91011 
AA443802 Hs.41007 
AA456001 Hs.93847 



EST cluster (not in UniGene) 
ESTs 

EST cluster (not In UniGene) 
ESTs 

CH.14Jisgi|5867000 

CK.17^hsgll5867224 

CH.19Jisgi|5867405 

CH.20Jisgq6552458 

CR20_hsgij6552458 

CH.21_hsgl|61 17842 

CH.01Jisgl|5867446 

CR01_hsgi|5867478 

CH.01_hsgi|5867501 

CR02_hsgiJ5867766 

CH.06_hsgi|6117819 

CH.07_hsgi(6Q04478 

CH.07_hsglj5868455 

CH.07_hsgi|6381927 

CRXJisgi|5868842 

CH.XjKgj|6682537 

CH.14j>2gi|6446516 

CH.15j)2gi|6563505 

CH.16j>2gi|5091594 

CH.19_p2gpi5302 

androgen receptor (dihydrotestosterone re 

ESTs; Highly similar to secreted apoptosi 

ESTs; Highly similar to secreted apoptosi 

HER2 receptor tyrosine kinase (c-erbB-2; 

c-myc binding protein 

Hs.73946 

interferan-sfimulated protein; 15 kDa 
interteukin 7 receptor 
gaiactosidase; beta 1 
FK506-binding protein 2 (13kD) 
carboxypeptkJase B1 (tissue) 
multiple UniGene matches 
cytochrome P450; 51 (lanosteroi 14-alpha 
tryptophan 2;3-dioxygena$e 
hepatocyte nuclear factor 3; alpha 
transporter protein 
cystatin BM 

Sec23 (S. cerevisiae) homotog A 
mannosyl (alphas ^-glycoprotein beta-1 
ESTs; Weakly similar to transformatton-r 
ESTs 

ESTs; Weakly similar to ill! ALU SUBFA 
ESTs 

general transcription factor IIA; 1 (37kD a 

Homo sapiens mRNA; cONA DKFZp434 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to CYTOCHROME 

ESTs 

ESTs 

hypothetical protein; similar to (U06944) 
dachshund (Drosophila) homotog 
ESTs 
ESTs 

ESTs; WeaWy slmilarto Wl ALU SUBFA 

ESTs 

ESTs 

anterior gradient 2 (Xenepus laevfe) homo 
ESTs; Weakly similar to cONA EST yk47 
ESTs 

242 
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331478 N26608 
331526 N499S7 
331533 N51517 
331681 W85712 
331686 W88502 

331750 AA284372 

331751 AA284840 
331760 AA292721 
331763 AA312861 
331825 AA411144 
331890 AA432166 
331952 AM54756 
332015 AA487910 
332043 AA490831 
332060 AA504779 
332071 AA598594 
332093 AA608794 
332139 AA620669 
332219 N22508 
332225 N33213 

332246 N57927 

332247 N58172 
332260 N70088 
332269 N91279 
332336 T96130 
332340 W15495 
332347 W60326 
332362 W93640 
332467 AA489630 
332499 M12036 
332513 AA018182 
332526 AA281753 
332532 N63192 
332565 AA234896 
332607 R41791 
332640 AA417152 
332694 AA262768 
332702 H93968 
332705 T59161 
332749 AA479968 
332927 

332929 

332930 

332955 

332958 

332961 

332983 

333009 

333010 

333013 

333108 

333139 

333254 

333305 

333343 

333388 

333455 

333459 

333517 

333585 

333679 

333743 

333758 

333767 

333768 

333769 

333795 



Hs.40639 
Hs.46624 
Hs.47282 
Hs.119571 
Hs.182258 
Hs.111471 
Hs.143818 
Hs.154434 
Hs.96704 
Hs.292862 
Hs.3577 
Hs.97837 
Hs.208800 
Hs.125056 
Hs.191402 
Hs.205293 
Hs.112592 
Hs.1 12879 
Hs.139315 
Hs.100425 
Hs.120777 
Hs.109370 
Hs.138467 
Hs.109654 
Hs.137551 
Hs.129781 
Hs.288684 
Hs.4779 
Hs.119004 
Hs.323910 
Hs.154424 
Hs77515 
Hs.1892 
Hs.25272 
Hs.36566 
Hs.6101 
Ks.243901 
Hs75725 
Hs.76293 
Hs.88251 



ESTs 
ESTs 
ESTs 

collagen; type III; alpha 1 (Ehlers-Oanfos 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to unknown [H.sap 

ESTs 

ESTs 

succinate dehydrogenase complex; subunl 
ESTs 

ESTs; Weakly similar to 111! ALU CLASS 

ESTs 

ESTs 

ESTs 

ESTs 

EST 

ESTs 

ESTs 

ESTs; Weakly similar to RNA POLYME 

ESTs 

ESTs 

ESTs; Moderately similar to outer membr 
ESTs 

chromosome 21 open reading frame 5 
ESTs 

ESTs; Moderately similar to similar to AD 

K1AA0665 gene product 

Human tyrosine Wnase-type receptor (HE 

deiodinase; lodothyronine; type II 

Inositol 1;4;5-trfphosphate receptor, type 

EST; Highly similar to PHENYLETHAN 

E1A binding protein p300 

UM domain kinase 1 

protein regulator of cytokinesis 1 

WAA1067 protein 

transge5n2 

thymosin; beta 10 



CH22_FGENES.38J 

CH22.FGENES.38J 

CH22.F6ENES.38_4 

CH2Z.FGENES.48J2 

CH22JGENES.48J5 

CH22_FGENES.48_18 

CH22.FGENES.54_5 

CH22.FGENES.61J 

O^GENES^U 

Cr^GENES.eU 

CH22_FGENESJ9J4 

CH22_FGENES.63J6 

CH22J=GENES.11BJ 

CH22J : GENES.137J 

CH22J=GENES.139J2 

CH22.FGENES.144J 

CH22LFGENES.157J 

Ot_2_FGENES.157J 

CH22_FGENES.173_2 

CH22JGENES.203.4 

CH22_FGENES.247_5 

CH22_FGENES564J 

CH22JFGENES.268J 

CH22_FGENES.27L6 

CH22J=GENES.271J 

CH22J : GENES^7U 

CH22J=GENES.275J 
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333796 

333892 

333904 

333905 

333921 

333968 

334102 

334222 

334223 

334264 

334343 

334360 

334784 

334789 

334794 

334889 

335004 

335115 

335287 

335342 

335491 

335495 

335498 

335544 

335610 

335653 

335682 

335687 

335755 

335782 

335791 

335809 

335822 

335823 

335824 

335825 

335895 

335917 

335920 

336035 

336042 

336093 

336096 

336150 

336152 

336416 

336444 

336449 

336471 

336512 

336558 

336560 

336676 

336959 

337968 

338008 

338057 

338410 

338451 

338588 

338665 



338832 
333980 
339352 
339373 



CH22.FGENES.275J 

CH22_FGENES.292_14 

CH22J : GENES.294J 

CH2ZJ=GENES.294_3 

CH22_FGENES.296_12 

CH2£_FGEMES.307_4 

CH2£_FGENES,327 60 

CH22.FGENES.360.3 

CH22_FGENES.350_4 

CH22.FGENES.367 15 

CH2a_FGENES.375 25 

CH22_FGENES.378_5 

CH2*_FGENES.432 9 

CH2^FGENES.432 14 

CH22_FGENES.434_2 

CH22_FGENES.452_3 

CH2£JGENES.47;L8 

CH22_FGENES.496J2 

CH2ZJGENES.526 11 

CH22_FGENES.536_1 

CH2£_FGENES.57QJ23 

CH2£JGENES.570_28 

CH22_FGENES.571_7 

CH22.FGENES.576 5 

CH22.FGENES.583 4 

CH2£J=GENES.590 4 

CH22_FGENES.595_2 

CH22_FGENES.596_2 

CH2Z_FGENES.6Q4_4 

CH22_FGENES.609 4 

CH22J=GENES.611 7 

CH22.FGENES.617 6 

CH22_FGENES.619_7 

CH22_FGENES.619_8 

CH22_FGENES.619J1 

CH22.FGENES.619J2 

CH22_FGENES.635_3 

CH22_FGENES.636_13 

CH22J=GENES.636J6 

CH22_FGENE$.678__6 

CH2Z_FGENES.679_4 

CH22J=GENES.691_2 

CH22J : GENES.691J> 

CH22.FGENES.706J 

CH22J=GENES.706_9 

CH22_fGENES.823_38 

CH22_FGENES.827J0 

CH22_FGENES.829 6 

CH22J : GENES.829 30 

CH2*_FGENES.B34_7 

CH22_FGENES.842JJ 

CH22.FGENES.842J 

CH2*/GENES.434 

CH22.FGENES.367-13 

CH22_EMACQ05500.bENSCAN.103-2 

CH22_EMAC005500.GENSCAN.127-9 

CH22_EM:ACQ05500.GENSCAN.160-1 

CH2^EMAC005500.GENSCAN.341-6 

CH22_EMAC005500.GENSCAN.359-3 

CH22_EM:AC005500.GENSCAN.432-1 

CH22_EM:AC005500.GEN$CAN.464-2 

CH22.EMAC005500.GENSCAN.475-3 

CH2a.DJ24607.GENSCAN.6-9 

CH22„DA59H18.GENSCAN^ 

CH22_BA354M2.GENSCAN.29-7 

CH22_BA232E17.GENSCAN.149 



6.8 

4.4 

6.5 

9.3 

9.6 

155 

7.1 

6.7 

33.5 

18.5 

6.1 

6.1 

4.8 

5.1 

7 

12.4 

7.9 

18.8 

4.5 

5.3 

24 

7 

1£2 

8.4 

125 

6.7 

12.1 

13.9 

11.5 

175 

27.3 

19.2 

19.1 

4.5 

40.2 

34.3 

10.2 

6 

8.8 

5.9 

5.8 

11.6 

7.6 

6.3 

10.5 

5 

4.8 

13.6 

6.9 

21.4 

8.2 

9 

9.4 

19 

13.4 

15.2 

13.9 

8 

11.6 

10.3 

4.8 

6.7 

4.8 

5.1 

6.9 

4.3 
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Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the ohgonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



123619 371681J AA602964 AA609200 

20 103207 30835.-4 X72790 

103349 11052_-2 X89059 , 

110856 19346 14 AA992380 M33063 N21418 H79958 R2191 1 H79957 

123169 44573.2 ^ 7 ^ 8 *S 
25 r»M^13«flAM2^AI5e0251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 

STiSm m^Z^mMmSSvZmmim AI763273 AWI73586 AW150329 AJ653832 

F00531 I H8348J I W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
123533 genbanlLAA608751 AA608751 
116480 genbank_C14088 C14088 
35 132225 genbank_AA128980 AA128980 , 
125154 genbank_W38419 W38419 

102919 25180 - 2 ffiS^wSKffi^ 

AC) AA075752 AW176066 AA287222 M195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515AW4015raM331367 

40 Kl40AW^ 

AM54395 EawSwS A1557324 AA121576 H39128 R77161 AA019688 AA380987 A^1jOAA34B^7 
^M^aSmAA^ 

45 SSnSKSi AA455763 R66853 AA294935 H85911 AA310414 H93436 N87014 

AA379497 AA3118lT AA017206AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 M374921 

50 ^^MS^FWlOMmTAOmSZXi AL039209 AA093339 AA333961 N86416 C03325 C03831 , AA194750 

aS^35946^TO4M92 AM39400 AA330784 A1908970 N89223 AW361938 AI940063 AA1 34323 AA371741 F01267 
A^^^70 W^l19ra AA346098 AWJ72969 AA337549 AA327342 H93855 AW74079 F291 18 ^ 
SmM^37^C0^^m5 C03145 C05199 AA346024 AA190506 AW361659 AI909845 AW374374 AW374382 
« m^^m^^^nmmmrmMS^^ C02843AA375948 AW374414 AA213946 AI525039 

55 H1CT^1007 W112044 1 M1344MH4W35 AW177018 AA429768 AA336673 AA112875 H46393 AA191267 059131 

AW4M037 aS^^^ H30301 AA300705 H43788 AW364149 

S2T3aS36CMW1A°^& 

AA366197 AwSS2AW406461 C03092R0308H56488AA336320 AW4065O1 AA354102 AAM^ 
fin ATO76830 ^^3446FOia9AVTO81388T94036 AA379643 H43842AI524063 M621727M379()99AA371417^R6M11 

60 K)N84mFM1^ 

^21M6 AI192549 aSm H25967 AA918121 AK26060 H20221 AA812572 H42178 AA887222 H96000 C03180 F0O946 



245 



WO 02/059377 



PCT/US02/02242 



C03986 AI318091 AI860172 AA582179 A1633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 AI571596 A1128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 AA600009 AI885540 AA771884 
AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 
A1389185 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA411329 M781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 AA948025 AA587857 M191540 AI460085 M193244 AI538037 
AA515572 AA758587 AI14931 1 AA508610 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781385 AA284536 AI819422 H16040 K27531 AA456564 AA845555 AI423598 AA012908 AA883439 AA716311 AAS58858 
AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 AI003519 AKJ02503 AI347597 
AI040946 AA197162 AA987883 AA292865 AW001944 AI640711 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 AI124097 AI880086 AA477513 AI758834 AI690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 
AA293474 AA001129 AA826789 AA641390 AA134405 F35585 AA477416 AW193359 AI361315 AA284988 F36340 AI361322 
F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 AA855535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 M654007 
AI830942 M1 13195 F26432 W56652 M464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AAQ86477 H09249 F20482 F26737 AA054148 AA857063 AA017259 
AA179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 
AJ302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
A1025737 AA101239 AA088887 AI214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AI811549 AA665180 AA708200 F01124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996 W28215 F2191 1 R65793 A1192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H142B6 F27532 AW337864 F2841 1 
H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602T40337 
F24835 F34672 F26474 A1926215 AA464185 F18217 N26193 AL043256 T41 197 F33055 F00388 F295O0 F34191 F33297 
A1937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F1 8319 F17978 F17566 
F3423Q F33258 F20860 F17998 AI695701 



129982 221_267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 Z14166 Z14167 Z14165 



AW403808 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF082140 L01278 
AF062134 AF062139 X81723 Z8084O X81733 XB1743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z11946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 128049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AJ268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF1 15130 AF1 151 17 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ008171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z98956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF022011 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 /*103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03B17 AW404978 AW403690 UB6801 ^103150 AJ010334 AF035027 AJ007327 AF103115 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF0S8666 AF06B665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NMJ002179 AW405146 AA301091 X04646 H64660 AW402990 AW408534 T93007 
A1857980 AW368899 AI905833 AW406586 AA482084 A1872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 M715498 AA295885 T27613 T981 13 



118600 genbanKJ469222 

118952 genbank_N92966 

120873 genbanl^AA358015 

113702 genbankjr97307 



N69222 
N92966 
AA358015 
T97307 



115763 gejibankuAA421560 AA421560 

124357 genbanl^N22401 . N22401 

108733 504187J AA121022 AA126422 

101544 entre*_M31169 M31169 

124447 genbanKJMBOOO N48000 

124677 genbank_R01073 R01073 

124777 genbankj*41933 R41933 

119302 genbankJ25725 T25725 
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103680 entrez Z93784Z93784 

135029 H58818 at H58818 

112253 genbank_R51818 R51818 
_ Ptey CAT number Accession 
5 322175 46877.1 AF085975 H53458 H53459 

323011 139750 J AA580288 AA315655AA133031 AA377748 

322975 1510563J C16391C16413 

317210 211994J AW881145AA490718M85637AA304575 T06067AA331991 

323817 233566 J AA410943 AW948953 AA334202 AA332882 
10 309583 1046029.-2 AW170035 

324961 376239J AA613792AW182329 T05304AW858385 

303502 325188 J BE174240 M488528 AL042253 

320133 447553J BE151746BE336853 D63271T94955AA774994 

311935 174129J AA216387T63548 AA228676 
15 321050 502195J C05928AW393497 

319977 345248J AA534222 AA632632 T81234 

312772 4380.7 AW962299 AA310349 AW962294 H63791 H63751 

321354 116028.-2 AA078493 

336512 CH22J941FGJ34_7_LINK_DJ 
20 336558 CH22_3992FG_842_3_LINK_OJ 

336560 CH22_3994FG_842_5_L!NKJ)J 

329367 cjehs 

329373 cjehs 

336676 CH22_4154FG_43 4. 
25 338008 CH22.6490FG_UNJCEMAC00 

338057 CH22_6558FG__UNK_ENtAC00 

329655 c14_p2 

336959 CH22.4764FG.367 13 

329899 c15j)2 
30 329960 c16_p2 

338410 CH2ZJ067FG__UNK-EM:AC00 

338451 CH22J124FG__UNK_EM*CO0 

338588 CH22_7331FG_UNK_EMACO0 

338665 CH22_7438FG_UNK.EMAC00 
35 338689 CH22_7464FG_UNK_EM^C00 

308677 AI761173 

338832 CH22_7678FG__UNKJX)246D7 
338980 CH2^.7859FG_UNK-OA59H18 
333009 CH22_233FG_61JJJNK_EMA 
40 333010 CH2^234FGJUJJNK_EM:A 
333013 CH2a_237FG.61.5.UNK»EMA 
308981 AI873242 
308995 AI880172 

333108 CH22_336FG_79J4_UNKJM: 
45 333139 CH22_368FG_83J6_UNK_EM: 

333254 CH22.495FGJ18.UINICEM: 

333305 CH22_550FG_137_2JJNK_EM: 

333343 CH22_589FGJ39J2JUNK_EM 

333388 CH22_634FGJ44_3_.UNK.EM: 
50 326213 c17_hs 

333456 CH2ZJ06FGJ57_5_LINK_EM: 

333459 CH22.709FGJ57_8_UNK.EM: 

333517 CH22_773FGJ73_2JJNK_EM: 

333585 CH22_846FG_203_4_LINK_EM: 
55 333679 CH22_941FGJ47_6 UNK.EM: 

326474 c19_hs 

333743 CH2^_1009FG_264_1JJNK_EM 
333758 CH22_1024FG_268_UINK.EM 
333767 CH22J034FGJ27UJJNK.EM 
60 333768 CH2*_1035FG_271J LIMK_EM 
333769 CH22_1036FG_271 8JUNK.EM 

333795 CH22_1063FG_275_1_LINK_EM 

333796 CH22J065FG_275 3JJNKJEM 
335004 CH22_2326FG_472 J JJHK.BA 

65 333892 CH22_1163FG_292_14JUNK_E 
335115 CH22_2447FG_496J.UNIUM 
333904 CH22_1176FG_294JLUNK_EM 
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333905 CH22J177FGJ94JJJNK.EM 
333921 CH22J194FG 296J2J.INK.E 
333968 CH2Z_1245FG_307_4_UNK_EM 
326059 c_6_hs 
5 335287 CH22_2629FG_526JUtNK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22J689FG_536_1_LINK_EM 
335491 CH22J843FGJ70_23_UNK_E 
10 335495 CH22J847FG_570_28JJNK_E 
335498 CH22_285QFG_571_7JJNK_EM 
328304 cJJs 
305453 AA738110 

335544 CH22JB99FG_576_5_UNK_EM 
15 335610 CH22_2969FG.583_4_UNK.EM 
335653 CH22_3013FG.590_4_LINK_EM 
335682 CH22_3043FG_595J_UNK_EM 
335687 CH2a.3048FGJ96J_LINK.EM 
328492 c_7Jis 
20 335755 CH22J3122FG_604_4_LINK_EM 
335782 CH22J151FG_609JJJNK.EM 
335791 CH22J160FG_611JJJNK-EM 
335809 CH2*J181FG_617J_UNK_EM 
335822 CH22_3195FG_619_7_LINK_EM 
25 335823 CH22J196FGJ19JJJNK.EM 

335824 CH22_3197FG_619_11J.INK_E 

335825 CH22J198FGJ19J2JL1NK.E 
335895 CH22J272FG_635J_UNK_EM 
335917 CH22_3294FG_636J3_LINK_E 

30 335920 CH22_3297FG_636_16_UNK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c_7_hs 
35 330084 c19_?2 

337968 CH22J419FG__UNK_EM:AC00 

309177 AI95111B 

309198 AI955915 

309226 A1969897 
40 339352 CH22J317FGJ.UNK.BA354I1 

309279 AI990102 

339373 CH22_8348FG__LINK3A232E1 
325622 c14_hs 

334102 CH22_1380FG_327_60_UNK_E 
45 332927 CH22_148FG_38JJJNK_C2QH 

332929 CH22_150FG_38_3_UNK_C20H 

332930 CH22_151FG_38JJJNK-C20H 
332955 CH22_179FG_48_12_UNKJEM: 
332958 CH2ZJ82FG_48J5JJNieEM: 

50 332961 CH22_185FG_48_18_UNKJEM: 
332983 CH2*jM7FG_54_5_UNK_EM:A 

334222 CH22_1506FG_360_3JJNK_EM 

334223 CH2*J507FGJ60_4JJNK_EM 
334264 CH22_1551FG_367_15JUNK_E 

55 327110 C21_hs 

334343 CH22J636FGJ75_25_UNK_E 

334360 CH22L1654FGJ78.5_UNK.EM 

327196 cj_hs 

327283 c_1_hs 
60 327313 cJJ* 

304465 AA421948 

304507 AA456426 

327450 cj_hs ' 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH22J096FGJ32J_UNK_EM 
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334789 CH22_2101FG_432_14_LINK_E 
334794 CH22J106FG_434_2J_INK_EM 
336035 CH22_3420FG_678_5JJNKJ)J 
336042 CH22_3427FG J79_4_LiNK_DJ 
5 336093 CH22_3481FGJ91J_UNKJ)J 
336096 CH22J484FGJ91 JJJNKJOI 
334889 CH22_2206FGJ52J_UN1CEM 
336150 CH22_354QFG_706_6_UNK_DA 
336152 CH22_3543FG_706_9_UNK_DA 
10 336416 CH2£_3833FG_823_38_UNK_B 
336444 CH22J864FG_827J0_UNK_D 
336449 CH2^3870FGJ29 JJJNKJXJ 
336471 CH22.3894FG_829J0.UNiU) 
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TABLE 13B 

Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (GJ) numbers. "Dunham I. et aL" refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham L et a!., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



332955 Dunham, I. elal. Plus 

332958 Dunham, I. etal. Plus 

332961 Dunham, I. etal. Pius 

333139 Dunham. I. etal. Plus 

333254 Dunham, L etal. Pius 

333305 Dunham, L etal. Pius 

333386 Dunham. I. etal. Pius 

333517 Dunham. I. etal. Plus 

333585 Dunham, I. etal. Plus 

333679 Dunham. I. etal. Plus 

333767 Dunham. I. etal. Plus 

333768 Dunham, I. etai. Plus 

333769 Dunham. I. etal. Plus 

333795 Dunham, I. etal. Plus 

333796 Dunham, I. etal. Plus 
333892 Dunham, 1. etal Plus 
333921 Dunham, I. etal. Plus 
333968 Dunham, I. etal. Plus 
334102 Dunham. I. etal. Pius 
334264 Dunham, I. etal. Plus 
334343 Dunham, Letat Plus 
334794 Dunham, I. etal. Plus 
334889 Dunham, I. etat Plus 
335287 Dunham. I. etal. Plus 
335491 Dunham, I. etal. Plus 
335495 Dunham. I. etal. Plus 
335498 Dunham, I. etaJ. Plus 
335653 Dunham. I. etat Plus 
335687 Dunham. I. etal. Plus 
335809 Dunham, I. etat Plus 

335822 Dunham, L etal. Plus 

335823 Dunham, t etal. Plus 

335824 Dunham, tetat Plus 

335825 Dunham, t etat Plus 
336035 Dunham, I. etat Plus 
336093 Dunham, I. etal Plus 
336096 Dunham, I. etal. Plus 
336444 Dunham. I. etat Plus 
336959 Dunham, t etal. Pius 
338008 Dunham, I. etal Plus 
338057 Dunham. letat Plus 
338410 Dunham, t. etal. Plus 
338588 Dunham. I etal. Plus 
338665 Dunham, L etal. Plus 
338832 Dunham, I. etat Plus 
338980 Dunham, I. etat Plus 
339352 Dunham, letat Plus 



2516164-2516310 
2521424-2521555 
336949W369571 
25214244521555 
4630388-4630645 
4913749-4913805 
5570729-5570925 
6234778-6234894 
7068795-7068896 
7694407-7694623 
7695440-7695697 



7807688-7807795 

7808253-7808319 

81568254157001 

8380325-8380441 

8681004-8681241 

9995140-9996373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19286024-19286515 

22299047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-26310909 

26364087-26364196 

26365925-26366004 

26376660-26376942 

26378175-26378268 

29016748^29017410 

29556922-29557002 

29578878-29579047 

3419058*34190718 

13233040-13233126 

7697068-7697236 

8526397-8526522 

19292807-19292916 

22896767-22896920 

24472654-24472853 

27775128-27775290 

29896789-29896874 

3354478443545121 
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332929 Dunham, I. etaL Minus 

332930 Dunham, I. eLaL Minus 
332983 Dunham, 1. eLaL Minus 
333009 Dunham,!. eLaL Minus 

5 333010 Dunham, I. eLaL Minus 
333013 Dunham, I. eLaL Minus 
333108 Dunham,). eLaL Minus 
333343 Dunham, L eLaL Minus 
333456 Dunham, I. etaL Minus 
10 333459 Dunham, I. etaL Minus 
333743 Dunham, I. etaL Minus 
333758 Dunham, L etaL Minus 

333904 Dunham, I. etaL Minus 

333905 Dunham, L eLaL Minus 
15 334222 Dunham, L etaL Minus 

334223 Dunham, I. etaL Minus 
334360 Dunham, I. eLai. Minus 
334784 Dunham,!. etaL Minus 
334789 Dunham. LeLat. Minus 

20 335004 Dunham, I. etaL Minus 
335115 Dunham, LeLal. Minus 
335342 Dunham,!. etaL Minus 
335544 Dunham, I. eta). Minus 
335610 Dunham, I. eta). Minus 

25 335682 Dunham, LeLal. Minus 
335755 Dunham, 1. eta). Minus 
335782 Dunham, 1. etaL Minus 
335791 Dunham, I. etaL Minus 
335895 Dunham, ). etal. Minus 

30 335917 Dunham, LeLal. Minus 
335920 Dunham, L etaL Minus 
336042 Dunham, I. eLaL Minus 
336150 Dunham, I. eLaL Minus 
336152 Dunham, I. eLaL Minus 

35 336416 Dunham, L eLaL Minus 
336449 Dunham, !. eLaL Minus 
336471 Dunham, I. eLaL Minus 
336512 Dunham, L eLal. Minus 
336558 Dunham, I. eLa). Minus 

40 336560 Dunham, I. eLa). Minus 
336676 Dunham, L etaL Minus 
337968 Dunham, LeLal. Minus 
338451 Dunham, L eLaL Minus 
338689 Dunham. LeLat. Minus 

45 339373 Dunham,!. etal. Minus 
325622 5867000 Plus 
329655 6448516 Minus 
329899 6563505 Minus 
329960 5091594 Minus 

50 326213 5867224 Minus 
326474 5867405 Plus 
330084 6015302 Minus 

326816 6552458 Plus 

326817 6552458 Plus 
55 327110 6117842 Plus 

327196 5867446 Plus 
327283 5867478 Minus 
327313 5867501 Minus 
327450 5867766 Minus 

60 328059 6117819 Plus 
328492 5868455 Minus 
328304 6004478 Minus 
328857 6381927 Minus 
329367 5868842 Minus 

65 329373 6682537 Minus 



2020758-2020664 

2022565-2022497 

2631933-2631797 

2766043-2765856 

2766207-2766119 

2772278-2772039 

3240494-3240389 

4692886-4692753 

2631933-2631797 

5144548-5144344 

7573218-7573060 

7666413-7666091 

8217374-8217261 

8217796-8217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

25908578-25908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

30150423-30150256 

30156053-30155870 

34047408-34047311 

34204707-34204577 

34215091-34214978 

3427837334278275 

3437582534375698 

34376814-34376596 

2022565-2022497 

7095797-7095680 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35565-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

57019-59337 

198354-198436 

199909-200001 

94608-94785 

180921-181333 

567-562 

69734-89833 

4792M8076 

3705247204 

4609446241 

3884-3952 

80557-81051 

8720147587 

3895039301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnigenelO: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breasttissue 



15 Pkey ExAccn UniGenelD Unigene Title 

100038 M97935 AFFX control: STAT1 

100114 D00596 Hs.82962 thymidylate synthetase 

100975 J02923 Hs.76506 r/rnphocyie cytosoBc protein 1 (L-plasfin) 

20 101031 J05070 Hs.151738 matrix metaBoproteinase 9 (gelatinase B; 9 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPYY 

101143 L12723 Hs.90093 heat shock 70kD protein 4 

101332 L47276 Hs.156346 Homo sapiens (cell line HL-6) alpha topois 

101378 M13755 Hs.833 Interferon-stimubted protein; 15 kDa 

25 101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 

102618 U65932 Hs.81071 extracellular matrix protein 1 

102721 U79241 Hs.118666 Human clone 23759 mRNA; partial cds 

102817 U90904 Hs.83724 Human clone 23773 mRNA sequence 

102907 X06985 Hs.202833 heme oxygenase (decycfing) 1 

30 102985 X17644 Hs.2707 G1 to S phase transition 1 

103060 X57766 Hs,155324 matrix metalloproteinase 11 (stromelysin 3 

103180 X69433 Hs.5337 isocitrate dehydrogenase 2 (NADP+): mlto 

103206 X72755 Hs.77367 monokine induced by gamma Interferon 

103821 AA157623 Hs.198793 K1AA0750 gene product 

35 104115 AA428090 Hs.26102 ESTs 

104667 AA007234 Hs.30098 ESTs 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564G 

106103 AA421104 Hs.12094 ESTs 

107151 AA621169 Hs.8687 ESTs 
40 109415 AA227219 Hs.110826 trinucleotide repeat containing 9 

110189 H20543 Hs.6278 DKFZP586B1621 protein 

110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CON 

110734 H98714 Hs.24131 ESTs 

110915 N46252 Hs.29724 ESTs 
45 111179 N67239 Hs.10760 ESTs 

111357 N91023 Hs.87128 ESTs 

112134 R46025 Hs.7413 ESTs 

113970 W86748 Hs.8109 ESTs 

114124 Z38595 Hs.125019 ESTs; Highly similar to WAA0886 protein 
50 114292 Z40715 Hs.184641 delta-6 fatty acid desaturase 

114901 AA236276 Hs.196437 ESTs; Weakly similar to R26660J; partial 
114965 AA25Q737 Hs.72472 ESTs 
115652 AA405098 Hs.38178 ESTs 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity t 
55 116790 H29532 Hs.101174 microtubule-associated protein tau 

116921 H72948 Hs.821 blglycan 

117412 N26722 Hs.42645 ESTs 

120241 Z41815 Hs.65946 ESTs 

120325 AA195651 Hs.104106 ESTs 
60 121598 AA416740 Hs.174104 ESTs 

123619 AA609200 ESTs 

124006 D60302 Hs.270016 ESTs 

125852 H09290 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B 
126160 N90960 Hs.265398 ESTs; Weakly similar to transformatiorwel 
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R1 

16,7 

15.9 

30.1 

37.2 

18.3 

17;4 

18.9 

18.1 

22.5 

23.2 

15 

15.2 

22.7 

20.6 

17.8 

18.9 

15.1 

23.3 

28.7 

16.6 

19.3 

15.4 

19 

20.1 

16.6 

195 

302 

23.2 

37 

15 

17.4 

15 

22 

19.4 

16.9 

35.1 

16.1 

33.5 

222 

20.7 

18.1 

15.6 

152 

22.6 

23.1 

20.6 

25.9 

16.4 
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127677 AA916752 Hs.264190 
128595 U31875 Hs.152677 
128717 T30617 Hs.104222 
129124 AA234530 Hs.108802 
129366 H18027 Hs.184697 
130455 X17059 Hs.155956 
130604 X03635 Hs.1657 
130913 W03592 Hs.21198 
130944 M97935 Hs21486 
131472 AA608962 Hs.27258 
131562 U90551 Hs.28777 
132180 AA405569 Hs.418 
132406 F09979 Hs.4774 
132465 AA047896 Hs.49169 
132994 AA505133 Hs279905 
133294 R79723 Hs.69997 
133634 U24166 Hs.234279 
134374 D62633 Hs.8236 
134405 J04177 Hs.82772 
134470 X54942 Hs.83758 
134495 D63477 Hs.84087 
134714 U89922 Hs.890 
135237 AA454930 Hs.9691 
301884 AA312082 Hs.105445 
302276 NMJ)04448Hs.323910 
302290 AL1 17607 Hs.175563 
309177 AI951118 
309583 AW170035 
310438 AW022192 Hs.200197 
311166 AI821294 Hs.118599 
312153 AA759250 Hs.153028 
313915 AI969390 Hs.163443 
314506 AA833655 Hs206868 
314558 AI873274 Hs.190721 
314691 AW207206 Hs.136319 
314943 AI476797 Hs.184572 
315196 AA972756 Hs.44898 
316177 AI908272 Hs293102 
318073 AW167087 Hs.131562 
318662 AI285898 Hs294014 
318740 NM_002543Hs,77729 
318744 AI793124 Hs.144479 
319668 NMJ02731HS.87773 
320074 AA321166 Hs.278233 
320211 AL039402 Hs.125783 
320727 U96044 Hs.181125 
322818 AW043782 Hs293616 
322882 AW248508 Hs279727 
324261 AUM4891 Hs.269350 
324432 AA464510 Hs.152812 
324603 AW016378 Hs292934 
324620 AA448021 Hs.94109 
324988 T06997 Hs.121028 
330388 X03363 
330486 M13755 Hs.833 
330814 AA015730 Hs.265398 
331145 R72427 Hs.129873 
331306 AA252079 Hs.63931 
331890 AA432166 Hs^577 
332526 AA281753 Hs.77515 
332532 N63192 Hs.1892 
332694 AA262768 Hs.243901 
332958 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 333968 
334223 
334264 



ESTs; Highly similar to MEM3 [Mjtuiscu! 
short-chain alcohol dehydrogenase family m 
Homo sapiens mRNA; cONA DKFZp566L 
N-elhylmaleimlde-sensiGve factor 
ptexinCI 

N-acetyltransferase 1 (arylarnlne N-acetylt 
estrogen receptor 1 

translocase of outer mitochondrial membra 

signal transducer and activator of transcript 

calcycfin binding protein 

H2A histone family; member L 

fibroblast activation protein; alpha; seprase 

ESTs 

ESTs 

solute carrier family 2 (facilitated glucose t 
zinc finger protein 238 
mlcrotubule-asso dated protein; RP/EB fam 
ESTs 

collagen; type XI; alpha 1 
CDC28 protein kinase 2 
K1AA0143 protein 

lymphotoxln beta (TNF superfamily; memb 
ESTs 

GDNF family receptor alpha 1 

EST duster (not in UniGene) with exon hit 

Homo sapiens mRNA; cDNA DKFZp564N 

EST singleton (not in UniGene) with exon 

EST 

ESTs 

ESTs 

cytochrome b-561 

ESTs 

ESTs 

ESTs 

ESTs 

ceO division cycle 2; G1 to S and G2 to M 
ESTs 

EST cluster (not in UniGene) 

ESTs 

ESTs 

EST cluster (not in UniGene) 
ESTs 

EST cluster (not In UniGene) 

EST cluster (not In UniGene) 

OEME-6 protein 

EST duster (not in UniGene) 

ESTs 

DiGeorge syndrome critical region gene 2 
EST cluster (not In UniGene) 
EST cluster (not In UniGene) 
ESTs 

EST duster (not In UniGene) 
EST cluster (not in UniGene) 
HER2 receptor tyrosine kinase (c-erbB-2; E 
Interferon-stimulated protein; 15 kDa 
ESTs; Weakly similar to transformation-rel 
ESTs; Weakly similar to CYTOCHROME 
(Jachshund (Orosophila) homolog 
succinate dehydrogenase complex; subunit 
Inositol 1;4;5-triphosphate receptor; type 3 
EST; Highly similar to PHENYLETHANO 
KIAA1067 protein 
CH22J=GENES.48J5 
CH22-FGENES.271J 
CH2£_FGENES.307_4 
CH2*_FGENES.360_4 
CH22J=GENES.367J5 



17.3 
27.1 
24.5 
20.7 
182 
26.4 

m 

20.9 

18.8 

18.1 

18.8 

15.4 

15 

15.4 

26.4 

30.4 

152 

152 

15.3 

20.3 

16.1 

35.7 

19.5 

20.7 

21.6 

41.4 

24.3 

645 

39.1 

24.1 

27.1 

27.1 

27.8 

225 

21.4 

18.4 

28.8 

32.6 

15.7 

16.3 

21.3 

35 

25.4 

16.7 

24.3 

15.3 

21 

15.3 

50.1 

16.7 

23.1 

212 

245 

17.7 

67 

44.1 

41.9 

15.1 

243 

19 

15.3 

152 

17.8 

m 

15.9 
335 
185 
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335791 CH22_FGENES.611_7 27.3 

336512 CH22_FGENES.834_7 21.4 

338008 CH22^EMAC005500.GENSCAN.127^ 15.2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

309583 1046029_-2 AW170035 
20 336512 CH22_3941FG_834_7J_INK_DJ 
338008 CH22_6490FG_UNK_EM:ACQO 
333769 CH22J036FG_271_8JJNK_EM 
333968 CH22J 245FGJ07jUJNK_EM 
335791 CH22_3160FG_61 1_7_LINK^EM 
25 309177 AI951118 

332958 CH22L182FGJ8J5J-IN1CEM: 
334223 CH22J507FG_360_4_UN^EM 
334264 CH22J551FGJ67J5JJIWLE 
123619 371681J AA602964 AA609200 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 14B 

5 Table 14B shows the genomic positioning for those pkeys lacking unigene ID'S and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al." refers to the publication 

entitled The DMA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposition 



332958 Dunham, 1. etal. Plus 2516164-2516310 

20 333769 Dunham, I. etal. Pius 7696625-7696707 

333968 Dunham, L etal Plus 86810044681241 

334264 Dunham. I. eUL Plus 13234447-13234544 

338008 Dunham, I. etal. Plus 7697068-7697236 

334223 Dunham, I. etal. Minus 12734365-12734269 

25 335791 Dunham, I. etal. Minus 25948563-25948411 

336512 Dunham, t. etal Minus 34278373-34278275 
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TABLE 15; Table 3 from BRCA 001-5 US 
Table 15 shows genes downregulated in breast cancer cells. 



Pkey: Unique Eos probeset Identifier number 

nSfS^ Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

1 U Unigene Title: Unigene gene title 

R1: Ratio of normal breast tissue to tumor 



Pkey ExAccn UnKBene ID Unigene Title 



15 

100115 000632 
100499 TIGR:HT1428 
100502 T1GR:HT1496 
100815 T1GRW4268 
20 101125 L10373 
101367 M12963 
101397 M15856 
101883 M98399 
102227 U25138 
25 102857 X00129 
103211 X73079 
103496 Y09267 
103562 Z2mQ 
104672 AA007629 
30 105083 AA146619 
105138 AA164519 
106075 AA417915 
106870 AA487576 
„ 107099 AA609645 
35 107616 AA004901 
107997 M037388 
108604 AA099820 
111130 N64265 
111837 R36447 
40 112538 R70255 
112608 R97970 
113086 T40652 
115740 AA4 18033 
115949 AA443800 
45 115965 AA446661 
117224 N20300 
117513 N32174 
119059 R15436 
119175 R71792 
50 119359 T71021 . 
119798 W73386 
120889 AA365784 
121381 AA405747 
121750 AA421184 
55 122127 AA434447 
122348 AA443695 
122485 AA448300 
123443 AA598841 
123505 AA600135 
60 125284 W94688 
126300 D81972 
126747 R72515 
127218 AA309765 
127357 AA452788 



Hs.2815 

Hs.18791 
Hs.15248 
Hs.25930 
Hs.26530 



Hs.172153 giutaftioneperoxWase 3 (plasma) 
Hs.283108 Giobin,Beta 
Hs.169228 Adrenal-Specific Protein Pg2 
Hs.9739 L-G^cefol^hosphate^aoVQxidoreduct 
Hs.82749 transmembrane 4 superfamily member 2 
tM Aft ato ° h ° l den ^rogenase 1 (dass I); alpha po 
Hs. 180878 lipoprotein lipase 
Hs.75613 CD36 antigen (collagen type I receptor; thr 
Hs.93841 potassium large conductance calclunvactiv 
Hs.7646'1 retinol-binding protein 4; fotersfifial 
Hs.288579 polymeric Immunoglobulin receptor 
Hs. 1 32821 flavin containing monooxygenase 2 

POU domain; class 6; transcription factor 1 
glyceroW-phosphate dehydrogenase 1 (sd 
ESTs; Weakly similar to CALCIUM-BIND 
ESTs 
ESTs 

serum deprivation response {phosphatidyls 
Hs.211568 eukaryotic translation iniMon factor 4 gam 
Hs.261164 ESTs 

Hs.82223 Human DNA sequence from clone 141H5 o 
Hs.49696 ESTs 

Hs.19515 yz44M2.s1 Morton Fetal Cochlea Homo sa 
Hs.24453 ESTs 
ESTs 

Hs.281022 EST 

Hs.209100 DKFZP434C171 protein 
Hs.283559 ESTs 
Hs.43125 ESTs 
Hs.173233 ESTs 
HS218707 ESTs 

Hs.44317 SRY(sex^etermlnlngregtonY)-box10 
Hs.77889 Friedreich ataxia region gene X123 
Hs.301002 ESTs;WeaWysinii(artocelldeathacfivato 
Hs.285681 ESTs; Highly similar to WS baslchelix-loo 
Hs.249129 ESTs 
Hs.97044 ESTs 

Hs.97984 ESTs; Weakly similar to WASP-famBy pro 
Hs.97549 ESTs 
Hs.106771 ESTs 
Hs.293410 ESTs 
Hs.160318 phosphotemman 
Hs. 167382 natriuretic pepMe receptor A/guarrylate cy 
ESTs; Moderately simtor to till ALU SUB 
Hs.103253 penliph 

HUM427D08B Human fetal brain (TFujw 
Hs.160318 phosphotemman 
Hs.116017 ESTs; Weakly similar to WAA0795 protei 
Hs.75432 zx39g 1 1 .rf Soares_tota{_fetus JMHFQ_9 



R1 

1.7 
1.5 
2.3 
1.7 
1.5 
2.9 
1.6 
1.6 
1.6 
3 

1.8 
1.5 
1.8 
24 
1.7 
1.5 
1.5 
1.6 
2.7 
1.6 
1.7 
24 
1.7 
1.6 
1.9 
1.5 
1.9 
1.6 
2 

2.2 
1.7 
1.7 
1.7 
2.8 
1.9 
3 

1.6 
1.8 
1.5 
25 
2.1 
1.5 
1.8 
1.5 
1.7 
1.8 
1.6 
1.5 
1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129134 ESTs; Weakly similar to ll.'l ALU SUSFA 1.5 

128351 AI092391 Hs.134886 ESTs 1.5 

128842 N44757 Hs.20340 ESTs 1.6 

5 128870 R71403 Hs.75309 euka^olic translation elongation factor 2 1.7 

129146 AA459944 Hs.108924 DKFZP586P1422 protein 1.5 

129285 T62068 Hs.11006 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGI-38 protein [H 1.5 

130085 M62402 Hs.274313 insulin-like growth factor binding protein 6 1.7 

10 130400 M25079 Hs.263 103 hemoglobin; beta 1.7 

131267 AA211776 Hs.2504 myomesln 1 (sketemin) (185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

131282 M12272 Hs.4 alcohol dehydrogenase 3 (dass I); gamma p 22 

131304 AA295848 Hs.25475 aquaporin7 1.7 

1 5 131810 049487 Hs.194236 leptin (murine obesity homotog) 15 

132788 AA045503 Hs.56874 ESTs; Weakly similar to Homo sapiens p2 1.6 

132931 Z41452 Hs.6090 deleted in bladder cancer rironrasomeregl 1.5 

133120 X64559 Hs.65424 tetranectin (plasminogen-blnding protein) 2 

133314 U95367 Hs.70725 garwna^inobutyrtoacid (GABA)Arecep 1.5 

20 133507 X74295 Hs.74369 integrin; alpha 7 1.7 

133601 S95936 Hs.284176 transferrin 2,3 

133702 N56898 Hs.75652 glutaftione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56614 Hs.6864$ deoxyrfbonuclease Mike 3 1.5 

25 134749 L10955 Hs.89485 carbonic anhydrase IV 1.6 

135173 M72B85 Hs.95910 Human G0S2 protein gene; complete cds 1.9 

300132 AW027556 Hs.156266 ESTs 1.7 

300732 A1369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293055 ESTs 1.8 

30 301140 AI807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST cluster (not in UniGene) with exon hit 1.7 

35 303844 U94362 Hs.58589 glycogenin2 1.5 

304182 H91086 EST singleton (not in UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 1 .7 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

40 306193 AA923457 EST singleton (not In UniGene) with exon 1.5 

307206 AI192534 EST singleton (not in UniGene) with exon 1.6 

307377 AI222691 EST singleton (not m UniGene) with exon 1.5 

308023 AI452732 Hs.251577 EST s/ngleton (not in UniGene) with exon 1.9 

308359 AI612774 Hs.79372 retinoid X receptor; beta 1.5 

45 309838 AW296073 Hs.255504 EST 1.5 

310403 AI720978 Hs.148006 ESTs; Moderately similar to alternatively s 1.8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.1181B0 ESTs 1.9 

50 312575 H25237 Hs.306814 ESTs Z3 

313076 N49684 Hs.143040 ESTs 1.8 

313283 W32480 Hs.157099 ESTs 2.2 

313374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs 1.7 

5 5 315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 AI205077 Hs.294085 ESTs 1.7 

316890 AA837079 Hs.24647 ESTs 1.5 

60 316983 A1480204 Hs.177131 ESTs 1.5 

317604 AI650625 Ks.300756 ESTs 1.6 

317951 AW206520 Hs.1 29621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not In UniGene) 1.5 

65 321594 AA021402 Hs.11067 ESTs 1.7 
322102 H45589 EST cluster (not in UniGene) 1.5 

322814 AI824495 Hs.211038 ESTs 2.2 
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322929 A1365585 

323831 AA335715 

324044 AL045752 

324675 AW014734 

325272 

325558 

325556 

326120 

326139 

326855 

327438 

329733 

330931 F01443 

331591 N71677 

332159 AA621393 

332364 W94688 

332502 H21819 

334175 

334347 

334737 

335352 

335639 

336244 

336336 

336865 

337494 

337764 

337983 

338192 

339366 



Hs.146246 
Hs.200299 
Hs.22350 
Hs,157969 



Hs.284256 

Hs.42146 

Hs.112984 

Hs.103253 

Hs.14896 



ESTs 2-3 

ESTs 1.7 

ESTs 1.8 

ESTs 2.2 

CH.11Jisgi]5866902 15 

CH.12_hsgij6056302 1.6 

CH.14Ju»gi|6056305 16 

CH.17_hsgi|5867194 15 

CH.17>gi)5867203 ■ 15 

CH.20Jisgil6552460 15 

CH.02_hsgi|6004454 16 

CH.14j)2gi|6065783 16 

ESTs 46 

ESTs 19 

EST 1- 5 

periEpin 2.1 
Homo sapiens done 24590 mRNAsequenc 15 

CH2SLFGENES.349J0 1 « 5 

CH2*J=GENES.375J1 13 

CH2^FGENES.424J2 18 

CH22_FGENES.539J 15 

CH22JGENES.584J9 16 

CH22_FGENES.746_2 1-5 

CH22_FGENES.814J 17 

CH22-FGENE&305-1 16 

CH2^FGENES.799-12 16 

CH22_EMAC000097.GENSCAN.119-1 18 

CH2£EM:ACQ05500.GENSCAN.110-1 2 

CH22 EMACQ05500.GENSCAN.228-1 15 

CH223A354U2.GENSCAN.34-2 15 
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TABLE 15A 



Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number. Gene cluster number 
Accession: Genbank accession" numbers 

15 

Pkey CAT number Accession 

126300 250375J D81972 BE003132 
20 112538 504579 J AA908813 R70255 



25 322102 46708J H45589H19807AF075038H19808 H42437 

336865 CH22L_4590FG_305J_ 

338192 CH2^6755FG_JJNK_EM:AC0O 

329733 c14j>2 

326120 c17_hs 
30 326139 c17Jis 

326855 c20_hs 

335352 CH22_2699FG_539_5JJNK_EM 

335639 CH22_2999FG_584J9_LINK^ 

307206 A1192534 
35 307377 AI222691 

337494 CH22_5727FGJ99J2_ 

337764 CH22_6115FG__UNK_EMAC00 

337983 CH22»6438FG_LINKLEMACO0 

339366 CH22_8336FG_LINK_BA354I1 
40 325272 c11Jis 1 

325558 c12_hs 

325656 c14_hs 

334175 CH22J455FGJ49J0JJNK^E 
304182 H91086 
45 334347 CH22J640FG_375_3UINK_E 
327438 cJJ\S 
304622 AA516384 

334737 CH22^2049FG_424_1ZJJNK-E 
304682 AA550994 
50 336244 CH2Z.3642FG_746JUUNK.DA 
306193 AA923457 

336336 CH22J746FG__814_8JJNK_BA 



10 



Unique Eos probeset identifier number 



123505 genbanK_M600135 
104672 6735J AA34 




H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
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TABLE 15B 

Table 15B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkev: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (61) numbers. "Dunham L et aL refers to the publication 

entitled The DNA sequence of human chromosome 22.' Dunham L et at, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposttion: Indicates nucleotide positions of predicted exons. 



Pkey Ref 

334347 Dunham, Letai. Plus 

334737 Dunham, I.etal. Plus 

335639 Dunham. LetaL Plus 

337494 Dunham, LetaL Plus 

334175 Dunham, I. eta). Minus 

335352 Dunham, LetaL Minus 

336244 Dunham, L etai. Minus 

336336 Dunham, LetaL Minus 

336865 Dunham, LetaL Minus 

337764 Dunham, LetaL Minus 

337983 Dunham, LetaL Minus 

338192 Dunham, I. etaL Minus 

339366 Dunham. L etai. Minus 

325272 5866902 Minus 

325558 6056302 Plus 

325656 6056305 Minus 

329733 6065783 Plus 

326120 5867194 Plus 

326139 5867203 Minus 

326855 6552460 Minus 

327438 6004454 Minus 



Strand NLposfflon 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

31402729-31402583 

33797209-33797076 

86224054622289 

40356404035446 

7275495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

78190-78707 

163237-163450 

36116^6276 

218901-218960 

111390-111463 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: . Exemplar Accession number, Genbank accession number 
10 UnigenelD: Unlgene number 
Unigene Title: Unlgene gene title 
R1: Ratio of normal breast tissue to tumor 



15 Pkey ExAccn 



20 



25 



30 



35 



40 



45 



100502 
101367 
102857 
104672 
107099 
108604 
115949 
115965 
119175 
119798 
122127 
122348 
129285 
131267 
131282 
131810 
133120 
133601 
134111 
301396 
311794 
312575 
313283 
322814 
322929 
324675 
330931 
332354 
337983 



TIGR:HT1496 

M12963 

X00129 

AA007629 

AA609645 

AA099820 

AA443800 

AA446661 

R71792 

W73386 

AA434447 

AA443695 

T62068 

AA211776 

M12272 

D49487 

X64559 

S95936 

N79674 

AA923549 

AW238092 

H25237 

W32480 

AI824495 

A1365585 

AW014734 

F01443 

W94688 



UniGene ID 


Unigene Title 


R1 


Hs.169228 


Adrenal-Specific Protein Pg2 


2.3 


Hs.4 


alcohol dehydrogenase 1 (class I); alpha 


2.9 


Hs.76461 


retinol-binding protein 4; interstitial 


3 




glyceraM-phosphate dehydrogenase 1 


2.4 


Hs.211568 


eukaryotic translation initiation factor 4 gam 


2.7 


Hs.49696 


ESTs 


2.4 


Hs.43125 


ESTs 


2 


Hs.173233 


ESTs 


2.2 


Hs.301002 


ESTs; Weakly similar to ceB death activator 


2.8 


Hs.249129 


ESTs 


3 


Hs,106771 


ESTs 


Z5 


Hs.293410 


ESTs 


2.1 


Hs.11006 


ESTs 


2.1 


Hs.2504 


myomesin 1 (skelemin) (185kD) 


3.8 


Hs.4 


alcohol dehydrogenase 3 (class I); gamma 


2.2 


Hs.194236 


leptin (murine obesity homotog) 


2,5 


Hs.65424 


tetranectin (plasmfnogen-binding protein) 


2 


Hs.284176 


transferrin 


2.3 


Hs.8022 


TU3A protein 


4.6 


Hs.224121 


ESTs 


2.1 


Hs.254759 


ESTs 


2.1 


Hs.306814 


ESTs 


2.3 


Hs.157099 


ESTs 


22 


Hs.211038 


ESTs 


2.2 


Hs.146246 


ESTs 


2.3 


Hs.157969 


ESTs 


2.2 


Hs.284256 


ESTs 


4.6 


Hs.103253 


perilipln 


2.1 




CH22_EM:AC005500.GENSCAN.110-1 


2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735_7 AA349096 A1368018 F21390 F17759 R48772 AI421405 A1300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 A1190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 



5 Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigeneiD: Unfgene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnfgenelO Unigene Title R1 

20 100227 AV654694 Hs.82316 interferon-induced, hepatitis C-assocfat 3 

100405 AW291587 Hs.82733 nidogen2 3.2 

100406 AI962060 Hs.1 18397 AE-bfriding protein 1 3.6 
100420 086983 Hs.1 18893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 * 5 2 

25 100960 J00124 Hs.1 17729 keratin 14 (epidermolysis bullosa simple 4.3 

101011 BE387036 Hs.1211 acid phosphatase 5, tartrate resistant 3 

101183 AA442324 Hs.795 H2Ahtstone family, member 0 3.2 

101194 L20971 , Hs.1 88 phosphodiesterase 4B. cAMP-specific (dun 3 

101329 U66042 Hs.82171 Homo sapiens clone 191 B7 placenta expres 4.1 

30 101378 BE563085 Hs.833 interferon-slimulated protein, 15 kDa 5.3 

101474 R07566 Hs.73817 small inducible cytokine A3 (homologous 3.9 

101491 M25809 Hs.64173 ATPase, H+ transporting, lysosomal (vacu 4.5 

101530 M29874 Hs.1360 cytochrome P450, subfamily HB (phenobar 9 

101602 AA353776 Hs.901 C048 antigen (B-cell membrane protein) 3.4 

35 101663 NMJ103528 Hs.2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs.79217 pyrrollne^-carboxylate reductase 1 3.6 

101767 M81Q57 Hs.180884 carboxypeptfdase B1 (tissue) 12 

101817 M89907 Hs.152292 SWI/SNF related, matrix associated, acfi 3.2 

101851 BE260964 Hs.82045 midkine (neurite growflvpromoting factor 4.1 

40 101878 M97815 Hs.183650 cellular retinoicackM)inding protein 2 6.5 

102209 NNL002Q38 Hs.265827 interferon, alpha-lnducible protein (do 3 

102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 

102297 NMJJ01504 Hs.1 98252 G protein-coupled receptor 9 3.7 

102299 NMJW5824 Hs.1 55545 37 kDa ieucine-rich repeat (LRR) protein 3.7 

45 102301 NrVL005651 Hs.183671 tryptophan 2,3-dioxygenase 5.2 

102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homoJog) 3.5 

102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 3.9 

102591 U62325 Hs.324125 amyloid beta (A4) precursor protein-bind 4 

102721 H16646 Hs.118666 hypothetical protein PP591 3.5 

.50 102739 AA363025 Hs.155572 Human clone 23801 mRNA sequence 3.2 

102791 AF080229 gb:Human endogenous retrovirus K clone 1 3 

102804 NMJW2318 Hs.83354 lysyl oxidase-Bke 2 3.2 

102903 M73779 Hs.250505 retinoic acid receptor, alpha 3.3 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

55 103042 T81656 Hs.252259 ribosomal protein S3 4.5 

103117 X63578 Hs.295449 parvalbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 

103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 

103284 AJ751601 Hs.8375 TNF receptor^ssociated factor 4 3.3 

60 103329 X85134 Hs.72984 refinoblastoma-blndlng protein 5 3.1 

103364 X90872 Hs.279929 gp25L2 protein 3 

103385 NAUW7069 Hs.37189 slmflartoratHREV107 3.4 

103456 AA496425 Hs.9629 papillary renal cell carcinoma (transloc 3.2 
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103498 Y09306 Hs.30148 homeodomain-Intefaceng pratein kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 3.7 

103563 L02911 Hs.15O402 Activln A receptor, type I (ACVR1) {ALK 3.2 

103612 BE336654 Hs.70937 H3 histone family, member A 4.5 

5 103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.88417 ESTs 38 

104103 AW021102 Hs-21509 ESTs 4.3 

104115 AF183810 Hs.26102 opposite strand to trktominophalangeal 7.6 

.104168 AA461618 Hs.31704 ESTs, Weakfy similar to KIAA0227 [H.sapi 3.6 

10 104173 AA084273 Hs.76561 ESTs. Weakly similar to S47072 finger pr 4 

104181 AF173296 Hs.283740 0C6 protein 3 

104189 AB040927 Hs.301804 KIAA1494 protein 3.2 

104269 A1559444 Hs.293960 ESTs 4.3 

104307 AI929700 Hs.1 11 680 endosulfine alpha 3.1 

15 104518 H20816 Hs.112423 Homo sapiens mRNA; cONA DKFZp586l1420 (f 3.2 

104556 AV650851 Hs.96900 hypothetical protein; KIAA1830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21933 fis, clone H 3.2 

104748 AA015879 Hs.33536 ESTs 3.2 

104755 T49951 Hs.9029 OKFZP434G032 protein " 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC15754 11.1 

104865 T79340 Hs^2575 B-ceH CUVlymphoma 6, member B (ztnc fi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cONA FU1 1027 fis, done PL 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 

30 105397 AA814807 Hs.7395 hypothetical protein FU23182 3.1 

105409 AW505076 Hs.301855 DiGeorge syndrome critical region gene 8 4.2 

105431 AA252033 Hs.242413 hypothetical protein DKFZp434K1421 4.4 

105552 AA256750 Hs.28802 centaurln-alpha 2 protein 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU 10504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 3.7 

105688 AI299139 Hs.17517 ESTs 5 5 

105808 AI133161 Hs.286131 CGM01 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FU00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FU20729 3.2 

40 105965 AA131657 Hs.23830 ESTs 3.3 

106135 AL1 17474 Hs.41 181 Homo sapiens mRNA; cONA DKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3.3 

106293 N39842 Hs.301444 K1AA1673 4.1 

106400 BE397649 Hs.94109 Homo sapiens cDNA FU13634 fis, clone PL 3.1 

45 106474 BE383668 Hs.42484 hypothetical protein FLJ10618 3.2 

106484 AA351978 Hs.4943 hepatocellular carcinoma associated prat 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs,335951 hypothetical protein AF301 222 3.8 

106636 AW958037 Hs.286 ribosomal protein L4 3.3 

50 106661 AW499914 Hs.7579 hypothetical protein FU1Q402 3 

106743 BE613328 Hs.21938 hypothetical protein FU12492 4.2 

106844 AA485055 Hs.158213 sperm associated antigen 6 3.4 

106864 AJ311928 gb:qo89h04jc1 NCI_CGAP_Kid5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
55 106871 AW472981 Hs.321130 hypothetical protein MGC2771 4.1 

106942 AA995351 Hs.31314 retinoblastoma-blnding protein 7 3,6 

106968 AF216751 Hs.26813 CDA14 5.3 

107105 AW963419 Hs.155223 stanniocaitin 2 3.4 

107158 N32849 Hs.31844 hypothetical protein FU12586 3.1 

60 107248 AW263124 Hs.315111 nudearreceriora>^ressc<WOAC3comp 5.9 

107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 A1955040 Hs.265398 ESTs, Weakly similar to transformaGon-r 3 

107890 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine 3.1 

65 107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564F053 (tr 4.8 

108000 A1263307 Hs.239884 H2B histone family, member L 3.3 

108217 AA058686 Hs.62588 ESTs 3.8 
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mQT77 

i uy*/ ( 


A A 4 OCX /41 

AA 13044,3 
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3.7 


infix in 
1U94 lU 


AW5Q4/32 


Un O407C k>»ultM«tuMl MMtnL CI I4 4A4 4 

ns.212/5 nypotnetical protein FU1 101 1 


4.6 


1 0QX CA 

1U9454 


AA232255 


Hs.295232 tSTs, Modeiateiy similar to A46010 X-lm 


6.4 


lU9ol4 


AA234Q87 


Hs.26234o tSTs, weakly similar to S72482 nypotneti 


4.8 


infiCQI 


R45584 


HS.23025 ESTs, Weawy simuar to ALU5_HUMAN ALU S 


3.3 


inofi^o 
lUyooZ 


a AlOCIIfl 
AA32oiJo 


lie- 01CQ77 Ki»nn(knt!/>ol nrn 4n!n O IOOC70 

ns.23D0f3 nypotnetical protein rU22b/2 


3 


109o44 


AW973964 


Hs.291 531 ESTs, Highly simitar to 1 203217A oenydro 


3 


lU9fUU 


rU9oU9 


gbrlbUoonOyz normalized infant Drain cDN 


3.2 




rUDOOO 


nS.l4/b3 tolS 


3.2 


109807 


R43646 


Hs.12422 ESTs 


3.8 




AWo 18435 


Hs.23590 solute carrier family 1 6 (monocarboxyfic 


3.3 


iU9oyo 


Al/AA4CQft 
AKOOI08O 


Hs.30488 DKFZP434F091 protein 


3.6 


1 1UU24 


AMf07*J4C0 

AW9/3152 


MS.31050 E5TS 


4.2 


110561 


AA379597 


Hs.5199 HSPC150 protein similar to ubiquitin-con 


5.1 


110675 


H89355 


Hs.249159 adrenergic, alpha-2A-, receptor 


5.3 


110707 


AI239832 


1U 4CC4T POT. lAf^^L^. J_!Ua 1^. Al 1 til IJIIAAAKI AIIIO 

Hs, 15617 ESTs, Weakly similar to ALU4 JiUMAN ALU S 


3.7 


110915 


BEQ92285 


Hs.29724 nypotnetical protein RJ13187 


3.7 


111110. 


MCXCQ1 


nS.29U94o to IS 


A 


1111 

mioo 


MGCCGI 

NboooJ 


HS.191350 toTS 


0 4 

3.1 


1 1 1 1 0ft 


AI7C7AOC 

A1767435 


HS.Zyo22 ESTS 


4.5 


1 1 1000 


AHo/OjO 


Uo OfiOQil COTe 
hSui9o94 to IS 


C A 

0.4 


444cm 
111510 


R07856 


nS.16355 ESTS 


3.2 


114 C70 


Drift Ad n 


gu.yiiyiuy.si ooares leiai uver spteen 


Q 4 

0.1 


1 neon 


AAOU2004 


MS 23260 ESTS 


3.2 


1 14Q*VJ 

111823 


R35252 


Hs^4944 ESTs, Weakly simaar to 21 09260A B cell 


3,3 


111 Q7C 

nio/o 


R38239 


ns.293240 to is, Weakly similar to putative pi50 [n 


O 4 

3.1 




A A A T 4 nil 4 

AA421081 


Uo 4OOQ0 COT* 

HS. 12388 EoTs 


3.4 


111893 


AF070526 


Hs. 1 3429 Homo sapiens clone 24787 mRNA sequence 


3.3 


4 4HOC 
112125 


AW379029 


Hs.1 18338 ESTs, Weakly similar to unnamed protein 


4.4 


4 4147A 
1121/0 


BE246743 


Hs.288529 nypotnetical protein FU22635 


7.3 


112287 


AB033064 


ii- on n r\f lyt a a a nin — i » 

Hs^34806 KIAA1 238 protein 


3.2 


112300 


H24334 


Hs^6125 ESTs 


4.4 


112303 


R54797 


go:yg87b07^l Soares infant brain 1NIB H 


3.4 


112478 


R66067 


Hs^8664 ESTs 


8.2 


4 41CC4 

1125b! 


A V7CH jno 

AJ791493 


Hs.1 29873 ESTs, Weakly simuar to A36036 cytoenrom 


5.5 


112631 


R82Q4Q 


gD.7juSbQ6.sl Soares placenta Nb2HP Homo 


3.9 


112637 


R82331 


Hs.1 64599 ESTs 


5.4 


112657 


AW844878 


Hs.19769 hypothetical protein M6C4 174 


3.2 


112678 


AI418466 


Hs.33665 ESTs 


4.7 


4 40Q47 


A AAQOJlCK 
AAUOZ400 


Hs.1 25031 choline/ethanoi«3mlnephosphotmnsferas6 


3.7 


113070 


AB032977 


Hs.6298 KIAA1151 protein 


3.1 


113095 


AA828380 


Hs.126733 ESTs 


3.4 


113117 


AW813731 


Hs.159153 ESTs, Moderately similar to S65657 alpha 


3.4 


113187 


BE613410 


Hs.31575 SEC63, endoplasmic reticulum translocon 


3.2 


113200 


T57773 


Hs.10263 ESTs 


3.5 


113206 


BE262470 


Hs.241471 RNB6 


6.2 


113374 


T79925 


Hs.269165 ESTs, Weakly similar to ALULHUMAN ALUS 


3.7 


113440 


U54727 


Hs.191445 ESTs 


3 


113494 


T91451 


Hs.86538 ESTs 


3.4 


113518 


AW367788 


Hs.323954 postmelotic segregation Increased 2-Uke 


3.1 


113571 


AJ702609 


Hs.15713 hypothetical protein MGC2776 


3.1 


113822 


NM.004585 


Hs.17466 retinoic add receptor responder (tazam 


3.9 


113835 


A1912410 


Hs.27475 Homo sapiens cDNA FU1 2749 fis, done NT 


3 


113938 


W81598 


gb:zd88g02^1 &»resJetaUieartNbHH19W 


4.6 


113947 


W84768 


gb:zh53d03.s1 Soares JetaUver spleen. 


3.1 


113970 


W27249 


Hs.8109 hypothetical protein FU21080 


6.9 


114086 


AA378776 


Hs^88649 hypothetical protein MGC3077 


4.3 
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114148 AW470411 H&288433 neurotrimin 

114424 AW780192 Hs267596 ESTs 

1 14518 AW163267 Hs.106469 suppressor of varl (S.cerevisiae) 3-0ke 

114563 AI979168 Hs.82226 glycoprotein {transmembrane) nmb 

114965 AI733081 Hs.72472 BMP-R1B 

114995 AA769266 Hs.193657 ESTs 

115121 AI634549 Hs.88155 ESTs 

115134 AW968073 Hs.194331 ESTs. Highly similar to A55713 Inositol 

115167 AA749209 Hs.43728 hypothefica! protein 

115253 BE149845 Hs.289038 hypothetical protein MGC4126 

115277 AA814100 Hs.86693 ESTs 

115327 N46436 Hs.109221 ESTs 

115354 AA281636 Hs.334827 ESTs 

115657 AA405620 Hs.55158 ESTs, WeaWy similar to T29520 hypotheti 

115676 AA953006 Hs.88143 ESTs 

115709 AW293849 Hs^8279 ESTs, Weakly similar to ALU7_HUMAN ALU S 

115729 AA417812 Hs^8775 ESTs 

115787 AI126772 Hs.40479 ESTs 

115830 AW970529 Hs.86434 hypothetical protein FU21 816 

115835 AA521410 Hs.41371 ESTs 

115850 NM.014937 Hs^2463 KIAA0966 protein 

115900 AK001500 Hs.165186 hypothetical protein FU13852 

115935 AA354549 Hs.41181 Homo sapiens mRNA; cONA DKFZp727C191 (fr 

115948 AL042465 Hs.43445 po!y(A)-specific ritjonuctease (deadenyia 

116092 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 

116115 AL042355 Hs.70202 WD repeat domain 10 

116184 AW450737 Hs.128791 CGI-09 protein 

116192 AA464976 Hs.62528 ESTs, Moderately similar to A46010 X-Un 

116208 A1219083 Hs.42532 ESTs, Moderately similar to ALU8_HUMAN A 

116246 AF265555 Hs.250646 baculoviral lAP repeat-containing 6 

116443 AW962196 Hs.321264 LBP protein 32 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

116726 AK001114 Hs.53913 hypofoetical protein FU10252 

1 16845 AA649530 gb:hs44f05.s1 NCLCGAP^Ahri Homo sapiens 

117026 H88256 Hs.50456 ESTs, Moderately simBar to ZN75_HUMAN Z 

117216 AI569804 1 Hs.42792 ESTs. WeaWy similar to 178885 serine/th 

117296 AL133427 Hs.42506 Homo sapiens mRNA full length insert cDN 

117403 H84455 Hs.40639 ESTs 

117691 AB040959 Hs.93836 DKFZP434N014 protein 

118229 AW968941 Hs.166254 hypothetical protein DKFZp566H 33 

118363 AI183838 Hs.48938 hypothetical protein FU21802 

118416 N66028 Hs.49105 FKBP-associated protein 

118470 AW970584 Hs.291033 ESTs 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (fr 

118695 AK000465 Hs.50081 KIAA1 199 protein 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN A 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 

119036 R95872 Hs.117572 chemoWne binding protein 2 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1_HUMAN A 

119075 M10905 Hs.287820 fibronectin 1 

119620 W47620 Hs.56009 Z-tf-oDgoadenytate synthetase 3 (100 k 

119741 AF041853 Hs.43670 Wnesin family member 3A 

119747 AI970797 Hs.64859 ESTs 

119754 AL037824 Hs.194695 ras homolog gene family, member I 

119905 AW449064 Hs.1 19571 collagen, type III, alpha 1 (Ehters-Oanl 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALULHUMAN A 

120241 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 

120326 AA196300 Hs.21145 hypothetical protein RGQ83M05.2 

120742 AA225084 gbaic21d06.r1 NCLCGAP_Pri Homo sapiens 

120870 AA357172 Hs.292581 ESTs, Moderately similar to ALU1 JflJMAN A 

120885 AA365515 Hs.301872 hypothetical protein MGC4840 

120970 AA39B118 Hs.97579 ESTs, Weakly similar to A46010 X-finked 

121054 AW976570 Hs.97387 ESTs 

121095 AA320134 Hs.196029 Homo sapiens mRNA far WAA1657 protein, 
121103 AA398936 Hs.97697 EST 

121121 AA399371 Hs.189095 simBar to SALL1 (sal (Drosoph2a)4ike 
121337 AW885727 Hs.301570 ESTs 
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121351 AW206227 
121611 M31669 
121643 AA640987 
121770 NMJJ15902 
122125 AK000492 
122338 AA443311 
122417 AA446965 
122513 AI767879 
122544 AW973253 
122655 AA3232S6 
122805 AA526911 
122851 AW205931 
123105 M487809 
123111 AA228776 
123249 AA371307 
123273 AA491253 
123385 BE149685 
123419 T66087 
123485 AI308876 
123645 AI675944 
123819 AA580082 
124012 AA352723 
124243 H69125 
124357 N22401 
124359 N22508 
124567 AW451645 
124911 N34151 
124972 R41396 
125006 BE065136 
125042 T78906 
125184 W60326 
125243 AW970536 
125286 AF086534 
125304 AL359573 

125330 AW880562 

125331 AI422996 
125685 AI924630 
126257 N99638 
126474 AW975814 
126666 AA648886 
126872 AW450979 
127431 AW771958 
127980 AA961459 
127997 AW06B311 
128420 AA650274 
128609 NNL003616 
128946 Y13153 
128955 AA775076 
129092 D56365 
129270 AA357185 
129301 AF182277 
129385 AA172106 
129619 AA209534 
129629 AKOO0398 
129725 X56411 
130069 AI754613 
130092 X03363 
130298 AI347487 
130382 NM.003450 
130622 AI582291 
130703 R77776 
130881 AA809875 
130954 AB014544 
131095 AJ399653 
131153 H09048 
131253 R71802 
131372 AW293399 



Hs.287727 hypothetical protein FU23132 
Hs.1735 Inhibin. beta B (adivin AB beta polypep 
Hs.193767 ESTs 

Hs.278428 progestin Induced protein 
Hs.98806 hypothetical protein 
Hs.98998 ESTs 
Hs.112092 ESTs 
Hs.99214 ESTs 
Hs.292689 ESTs 

Hs.37337 Homo sapiens mRNA; cDrtt DKFZp547J047 (fr 
Hs.82772 collagen, type XI, alpha 1 
Hs.99598 hypothetical protein MGC5338 
Hs.166011 catenln(cacflierin^associated protein), d 
Hs.191721 ESTs 
Hs.125056 ESTs 

Hs.173611 Empirically selected from AFFX single pr 
Hs.17767 KIAA1554 protein 
Hs.1 1 2482 Homo sapiens unknown mRNA sequence 
Hs.103849 hypothetical protein DKFZp761D112 
Hs.188691 Homo sapiens cDNA FU 12033 lis, done HE 
Hs.112264 ESTs 
Hs.241471 RNB6 
Hs.133525 ESTs 

gb:yw37g07.s1 Morton Fetal Cochlea Homo 
Homo sapiens cONA: FU21479 fe, clone C 
Homo sapiens cDNA FU11973 lis, clone HE 
interferon Induced transmembrane protein 
hypothetical protein FU23045 
spfidng factor {CC1. 3) 

ESTs, Moderately similar to ALUIJtUMAN A 
Homo sapiens cDNA FU11750 fis, clone HE 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 
GTP-blnding protein 
ESTs 
ESTs 

hepatocellular carcinoma associated prot 
gb:za39g11.r1 Scares fetal liver spleen 
Homo sapiens done IMAGE:713177, mRNA se 
ESTs 

gb:U!-H-BI3-ala-a-12.<>-Ul.s1 NCI_CGAP.Su 
ESTs, Moderately similar to PC4259 fern 
ESTs 

Homo sapiens mRNA full length Insert cDN 
fibronectin leucine rich transmembrane p 
survival of motor neuron protein interac 
kynurenine 3-monooxygenase (kynurenine 3 
Homo sapiens, Similar to PRO0478 protein 
poly(rC)-binding protein 2 
ras homotog gene family, member H 
cytochrome P450, subfamily IIB (phenobar 
Rag C protein 
tetraspan NET-6 protein 
hypothetical protein FU20391 



Hs.139315 
Hs.151504 
Hs.174195 
Hs.101774 
Hs.145696 
Hs.269432 
Hs.288684 
Hs.105413 
Hs.187561 
Hs.124940 
Hs.1 14574 
Hs.161378 
Hs.4943 

Hs.326714 
Hs.151999 

Hs.175437 
Hs.125644 
Hs.311054 
Hs.41296 
Hs.102456 
Hs.107316 
Hs.185807 
Hs.63525 
Hs.109918 
Hs.330780 
Hs.1 10950 
Hs.284243 
Hs.11747 
Hs.1219 
Hs.146428 
Hs.323910 
Hs.132781 
Hs.155204 
Hs.16846 
Hs.18103 
Hs.25933 
Hs.21572 
Hs.22917 
Hs.23606 
Hs.24853 
Hs,144904 



collagen, type V, alpha 1 
v-erb-b2 avian erythroblastic leukemia v 
class I cytokine receptor 
zinc finger protein 174 
ESTs, Weakly similar to 04HUD1 debrisoqu 
ESTs 
ESTs 

KIAA0644 gene product 

ESTs 

ESTs 

ESTs 

nudear receptor co-repressor 1 
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4.7 

3.1 

3.3 

3 

3 

3.2 
3.2 
4 
4 

3.8 
3 

3.6 
4.1 
3.3 
4.6 
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131507 
131587 
131739 
131795 
131970 
131985 
132093 
132122 
132159 
132333 
132406 
132482 
132543 
132624 
132700 
132725 
132799 
132847 
132857 
132936 
133130 
133142 
133167 
133225 
133274 
133275 
133287 
133376 
133462 
133740 
133831 
133976 
134666 
134710 
134731 
134776 
135230 
135303 
135400 
135411 
300089 
300233 
300254 
300256 
300378 
300973 
301111 
301341 
301548 
301884 
301936 
301976 
302001 
302067 
302094 
302099 
302145 
302235 
302276 
302290 
302372 
302378 
302384 
302385 
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AI695549 
AF017986 
BE501849 
D86960 
NM_002314 
AA400091 
M426202 
D75435 
AA192669 
AL133731 
AV660345 
BE568452 
AA326108 
AA319233 . 
NMJQ6276 
W73311 
T48195 
Y00272 
AL120659 
AI128606 
AW952412 
AW162840 
AW600291 
AA085191 
Z93241 
AW797437 
BE618768 
AW675064 
AW162919 
BE274552 
A1908165 
BE391929 
AW33797 
089377 
J05582 
AFQ64604 
•R61253 
X78592 * 
L10333 
AI199738 
AW614220 
AW183618 
AW591433 
Z45270 
AA572949 
R10799 
AA887801 
A1091631 
AA312082 
U79745 
T97905 
AB020711 
BE542706 
AW749321 
AL049670 
NMJQ3613 
AL049987 
AW057736 
AA179949 
AL1 17406 
AL109712 
A1678059 
AJ224172 
AW192334 
A1038997 
AF282265 



Hs.27769 
Hs.183868 
Hs.31386 
Hs.32317 
Hs.3610 
Hs.36566 
Hs.39421 
Hs.40403 
Hs.41154 
Hs.45032 
Hs.4774 
Hs.238126 
Hs.5101 
Hs.33829 
Hs.5521 
Hs.184167 
Hs.169407 
Hs.58189 
Hs.184572 
Hs.6111 
Hs.6557 
Hs.65874 
Hs.6641 
Hs.6823 
Hs.6949 
Hs.239934 
Hs.69771 
Hs.7232 
Hs.73875 
Hs.170160 
Hs.76578 
Hs.169946 
Ks.8752 
Hs.8889 
Hs.89404 
Hs.89603 
Hs.96757 
Hs.98265 
Hs.99915 
Hs.99947 
Hs.208275 
Hs. 189402 
Hs.55610 
Hs.298241 
Hs.235873 
Hs.207566 
Hs.191990 
Hs.208229 
Hs.203845 
Hs.105445 
Hs.114924 



ESTs, Weakly similar to MCAT_HUMAN MITOC 
glucuronidase, beta 
secreted frizzied-related protein 2 
high-mobility group 20B 
KIAA0205 gene product 
UM domain kinase 1 
ESTs 

Cbp/p300-interactlng transactivator, wit 
2k family member 1 (odo^paired Drosophi 
ESTs 

Homo sapiens mRNA; cDNA DKFZp761C1712 (f 
CGl-49 protein 

protein regulator of cytokinesis 1 
bHLH protein DEC2 
ESTs 

splicing factor, arginine/serine-rich 7 
SAC2 (suppressor of actin mutations 2. y 
eukaryotic translation initiation factor 
ceB division cycte % G1 to S and G2 to 
aryWiydrocarbon receptor nuclear transl 
zinc finger protein 161 
ESTs, Weakly similar to A40348 Eiav/Sex- 
kinesln family member 5C 
hypothetical protein RJ1O430 
hypothetical protein MGC11275 
CGI-96 protein 
B-factor, properdin 

acetyl-Coenzyme A carboxylase alpha 
fumarytacetoacetate hydrolase (fumaryiac 
RAB2, member RAS oncogene family-like 
protein Inhibitor of activated STAT3 
GATA-binding protein 3 (T-cetl receptor 
transmembrane protein 4 
serine hydroxymethyttransferase 1 (solub 
msh (Drosophita) homeo box homotog 2 
mucin 1, transmembrane 
suppresswofTy(S.cerevislae)3homolo 
KIAA1877 protein 

androgen receptor (dihydrotestosterone r 
reticulon 1 

ESTs, Weakly similar to ALUAJfUMAN llil 
ESTs 

solute carrier family 30 (zinc transport 



hypothetical protein FU22672 
ESTs 
ESTs 

two pore potassium channel KT3.3 
GDNF family receptor alpha 1 
solute carrier family 16 (rronocarboxylic 
gb:ye54c10.r1 Scares fetal liver spleen 
Hs.278346 K1AA0904 protein 
Hs.222399 CEGPI protein 
Hs.6786 ESTs 

Hs.137576 ribosoma! protein L34 pseudogene 1 
Hs.151407 cartflage intermediate layer protein, nu 
Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F112 (fr 
Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 
Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 {f 
Hs.200102 ATP-oinding cassette transporter MRP8 
Hs.296506 Homo sapiens mRNA fuD length Insert cDN 
Hs.202676 synaptonemal complex protein 2 
Hs.204096 CpophninB(uterogtoblnfamiyrnemDer) 
Hs.38218. ESTs 
Hs.132921 ESTs 

Hs.44836 inner centromere protein antigens (135kD 
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302892 AW176909 Hs.42346 calcineurin-binding protein calsardn-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A4901 9 dyneln he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from done RP5-1 103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypofoeti 4.2 

303540 AA355607 Hs.309490 ESTs, WeaWy simitar to putative WHSC1 p 4.3 

303563 M367699 Hs.10082 potassium intermediate/small conductance 3.3 

303642 AW299459 gb:xs50d08.x1 NCLCGAPJ<id11 Homosapien 4.2 

303780 AM24014 Hs.18995 K1AA1304 protein 3.6 

1U 303797 AW629759 gb:hh7Qe05.y1 NCLCGAPJ5U1 Homo sapiens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 1138 3.7 

304328 M149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCLCGAPJ3as1 Homo sapiens 4.1 

305913 AA876109 gb:nx24h01.s1 NCLCGAP GC3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04.s1 NCLCGAPJ>r25 Homo sapiens 3.1 

307010 AI140014 gb:qa68(09j(1 SoaresJetoL*earLNbHH19W . 3.5 

307041 A1144243 gb:qb85b12jc1 Soares_fetaLhearLNbHH19W 3.9 

308106 AI476803 gb:tj77e12.x1 Soares_NSF F8 9W OTJ>/U> S 4.3 

308307 AI581398 Hs.172928 collagen, type I, alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypothetical protein RJ23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A27217 glucose tr 3.2. 

309574 AW168083 gb:xg59g04.x1 NCLCGAPJJt4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

25 310064 AI199712 Hs.148486 ESTs, Weakly similar to 1917210A Pro/Arg 4.6 

310098 AI685841 Hs.161354 ESTs " 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs.160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for K1AA1 551 protein, 3.6 

30 310781 AI380797 Hs.158992 ESTs 10.2 

310895 A1955121 Hs.165724 N^cetylgalactosaminM-(>sulfotransfera 3.4 

310955 AI476732 Hs.263912 ESTs 10.9 

311117 AI671439 Hs.196029 Homo sapiens mRNA forKIAA1657 protein, 3.1 

311166 AI821005 Hs.118599 ESTs 10.8 

35 311237 AA641098 Hs.208809 ESTs, Moderately similar to AIU1_HUMAN A 4.3 

311465 AJ758660 Hs.206132 ESTs ~ 4.4 

311587 AI828254 Hs.271019 ESTs, Weakly similar to A47582 B-ceH gr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700370 Hs.14304 ESTs 3.3 

40 311785 A1056769 Hs,133512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSFJiUMAN PTB-AS 3 

311913 AI358522 Hs.270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

45 311935 AA216387 gb:nc16b02.s1 NCI CGAPJ>r1 Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.118064 similar to rat nuclear ubiquitous casein 3.8 

50 312147 AI633744 Hs.195648 ESTs, Weakly similar to 136022 hypothec 4.4 

312153 BE261944 Hs.1 18625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3,3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA70O439 Hs.188490 ESTs 3.4 

5 5 312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALU8_HUMAN 111! 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 6.3 

60 312638 AW439195 Hs.256880 ESTs, Weakly simSar to S65657 alpha-1C- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.115685 ESTs 3.1 

313070 AI422023 Hs.161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteoDpid protein 1 (Pellzaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313098 AW073310 Hs.163533 Homo sapiens cONA FU14142 fts. clone MA 4.5 
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313126 
313166 
313197 
313280 
313325 
313328 
313352 
313385 
313393 
313417 
313434 
313569 
313591 
313615 
313915 
313975 
313979 
313997 
314043 
314078 
314097 
314121 
314129 
314138 
314236 
314244 
314305 
314306 
314322 
314394 
314401 
314465 
314506 
314510 
314546 
314547 
314558 
314627 
314648 
314691 
314729 
314754 
314814 
314864 
314881 
314882 
314981 
315006 
315021 
315051 
315060 
315073 
315080 
315175 
315183 
315193 
315196 
315198 
315240 
315263 
315282 
315296 
315368 
315397 
315489 
315498 
315526 



AA746503 
A180109S 
AW979008 
AW960454 
AI420611 
AW449211 
AW150945 
A1032087 
AI674685 
AA741151 
W92070 
AI273419 
AA046309 
A1540978 
C18863 
AW175896 
A1535895 
AV657317 
AA827082 
AW129357 
M648744 
AI732083 
AA228366 
AA740616 
AA743396 
AL036450 
AI280112 
A1697901 
M907153 
AW961597 
AI660412 
AA602917 
AA833655 
AI204418 
AW007211 
AA399272 
A1873274 
AA425310 
AW979268 
AW207206 
AA457367 
AW026761 
BE350122 
AW971198 
A1095087 
AA828032 
AW972359 
AI538613 
AA533447 
AW292425 
AA551104 
AW452948 
AA744550 
AI025842 
AW136134 
AI241331 
AI367347 
AI741506 
R38772 
AW510994 
A1222165 
AA876905 
AB037745 
AA218940 
AI378817 
AA628539 
A1193043 



Hs.283313 ESTs 
Hs.151500 ESTs 
Hs.222487 ESTs 
Hs.222830 ESTs 
Hs.127832 ESTs 

Hs.105445 GDNF family receptor alpha 1 
Hs.144758 ESTs 
Hs.269819 ESTs 
Hs.200141 ESTs 
Hs.137323 ESTs 

gb2h48g05.r1 
Hs.135146 hypothetical protein RJ13984 

gb2f12ft)1.s1 SoaresJetaUiearLNbHH19W 
Hs.301997 hypothetical protein RJ13033 
Hs.163443 Homo sapiens cDNA FU1 1576 tis, done HE 
Hs.65114 keratin 18 
Hs.221024 ESTs 

Hs.288649 hypothetical protein MGC3077 
Hs.291872 ESTs 
Hs.329700 ESTs 
Hs.269493 ESTs 
Hs.187619 ESTs 
Hs.1 15122 ESTs 

gb:ny97f11.$1 NCLCGAPJ5CB1 Homo sapiens 
Hs.189023 ESTs 
Hs.103238 ESTs 

Hs.125232 Homo sapiens cDNA RJ13266 fis, done OV 
Hs.192425 ESTs 
Hs.190060 ESTs 

Hs.130816 ESTs, Moderately similar to 138022 hypot 
Hs.234557 ESTs 
Hs.156974 ESTs 

Hs.206868 Homo sapiens cDNA FU14056 fis. done HE 
Hs.190080 ESTs 

Hs.16131 hypottietcal protein FU12876 
Hs.144341 ESTs 
Hs.190721 ESTs 

Hs.155766 ESTs,Weak(ysirnnajtoA47562B-ce0gr 

gb:EST391378 MAGE resequences. MAGP Homo 
ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to 178885 serine/th 
ESTs 

ESTs, Moderately simitar to S65657 alpha 
ESTs 
ESTs 

Transmembrane protease, serine 3 
ESTs 



Hs.136319 
Hs.191638 
Hs.134374 
Hs.157367 
Hs.294068 
Hs.152299 
Hs.169076 
Hs.293334 
Hs.298241 
Hs.312989 
Hs.163484 
Hs.189048 
Hs.257631 
Hs.136345 
Hs.152530 
Hs.220277 
Hs.131765 
Hs.44898 
Hs.165900 
Hs.172619 
Hs.220740 
Hs.144923 
Hs.125286 
Hs.104696 
Hs.137516 
Hs.191847 
Hs.1 16252 
Hs,128685 



ESTs 

ESTs, Moderately similar to ALUCJWMAN ! 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Moderately simflarto 138937 DNA/R 

Homo sapiens clone TCCCTA00151 mRNA sequ 

ESTs, Weakly similar to ALU1 JWMAN ALU S 

myelin transcription factor 1«fike 

ESTs 

ESTs 

ESTs 

WAA1324 protein 
1 



ESTs 

ESTs, Moderately similar to AIU1_HUMAN A 
ESTs, Weakly similar to T17226 hypothec 
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3.5 

3.3 

4.7 

3.4 

12.4 

4.1 

3 

5.2 
3.5 
3.7 
3 

5.6 
3.2 
26.3 
3 

4.9 
3.9 
3.1 
8.3 
6.6 
6.2 
4 

5.9 
3.1 
4 
8 

3.7 
3.3 
42 
3.3 
4.7 
8.5 
4 

3.4 

6.7 

27.4 

4.4 

4.6 

20.7 

3.6 

3.6 

4.9 

4.3 

37 

3.1 

3 

10.9 
53 
12.9 
5.8 
42 
3.7 
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8.2 
3.6 
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315530 
315562 
315634 
315647 
315707 
315772 
315850 
315858 
315878 
315977 
315978 
315995 
316012 
316042 
316052 
316072 
316074 
316100 
316133 
316177 
316186 
316244 
316303 
316313 
316364 
316580 
316697 
316715 
316868 
316869 
316886 
316897 
316943 
317069 
317194 
317360 
317404 
317452 
317501 
317674 
317803 
317834 
317850 
317881 
317902 
317916 
318042 
318223 
318327 
318332 
318418 
318558 
318625 
318634 
318740 
318744 
318781 
319191 
319478 
319510 
319551 
319745 
319834 
319840 
319977 
320074 
320167 



AW015415 Hs.127780 ESTs 8.9 

AA737415 Hs.152826 ESTs 5.5 

AA837085 Hs^20585 ESTs 6.3 

AA648983 Hs.212911 ESTs 3.6 

AM18055 Hs.161160 ESTs 5.1 

AW515373 Hs.271249 Homo sapiens cONA FU13580 fis, clone PL 3.1 

AW270550 Hs.1 16957 ESTs 3.8. 

AA737345 Hs.294Q41 ESTs 5 

AA683336 Hs.189046 ESTs 3.1 

AW865916 Hs.151206 ESTs 4.7 

AA830893 Hs.119769 ESTs 4.1 

AI217477 Hs.194591 ESTs 4.1 

AA764950 Hs.119898 ESTs 7 

AI469960 Hs.170698 ESTs 4.9 

AJ962796 Hs.136754 ESTs 4.1 

AW517524 Hs.135201 N0D2 protein 3.2 

AW975114 Hs.293273 ESTs 3.8 

AW203986 Hs.213003 ESTs 3.2 

AI187742 Hs.125562 ESTs 3.7 

A1904982 Hs.293102 ESTs. Moderately similar to ALU1JUJMAN A 30.7 

AI433540 gb:fi69g05 j(1 NCLCGAP«.Wd11 Homo sapien 3.1 

AI640761 Hs.224988 ESTs 3.5 

AA740994 Hs.209609 ESTs 3.8 

AA741300 Hs.202599 ESTs, Weakly similar to 138022 hypotheti 4.4 

AA747807 Hs.149500 ESTs 3.2 

AA938198 Hs.146123 poly(A) polymerase gamma 9.4 

AW293174 Hs.252627 ESTs 4.4 

A1440266 Hs.170673 ESTs, Weakly similar to T24832 hypotheti 3 

AI660898 Hs.195602 ESTs 3.2 

A1954880 Hs.134604 ESTs 3.2 

AA836331 Hs.134981 ESTs 4.4 

AA838114 Hs.221612 ESTs 3.7 - 

AW014875 Hs.137007 ESTs 4.6 - 

AI732892 Hs.190489 ESTs 5.9 

AW445167 Hs.126036 ESTs 4.1 

AI125252 Hs.126419 ESTs 3.5 

A1806867 Hs.126594 ESTs 5.1 

AA972965 Hs.135568 ESTs 69 

AI822034 Hs.137097 ESTs 4.6 

AW294909 Hs.132208 ESTs 4.3 

AW664964 Hs.128899 ESTs 6.1 

X56348 Hs.287270 ret proto-oncogene (mulfiple endocrine n 3.1 

AI681545 Hs.152982 hypothetical protein FU131 17 3.4 

AI827248 Hs.224398 Homo sapiens cONA RJ1 1469 fis, done HE 9.6 

AW102941 Hs.211265 ESTs 4.1 

A1565071 Hs,159983 ESTs 103 

AW294522 Hs.149991 ESTs 3.1 
AI077540 Hs.134090 ESTs 3.9 
AW294013 Hs.200942 ESTs 3 
AI093930 Hs.163440 Homo sapiens cONA: FU210QG fis, clone C 4.4 
AF107493 Hs.118498 Homo sapiens LUCA-15 protein mRNA, spfic 5.4 
AW402677 Hs.146381 RNA binding motif protein, X dromosome 4.4 
AA526235 Hs.193162 Homo sapiens CONARJ1 1983 lis, done HE 5.9 
T49598 Hs.156832 ESTs 4 
NMJW2543 Hs.77729 oxidised tow density lipoprotein (lectin 7.3 
A1793124 Hs.144479 ESTs 17.8 
F11802 Hs.6818 ESTs 3 
NNL012391 Hs.79414 prostate epithelium-specific Ets transcr 3.6 
A1524124 Hs.270307 ESTs 4.6 
W88532 Hs.254562 ESTs 3.3 
AA761668 gb:nz24c08.s1 NCLCGAP.GCB1 Homo sapiens 3.2 

T79366 Hs.108258 acfln binding protein; macrophin (microf 3.3 
AA071267 gb'.zm61g01 .rl Stratagene fibroblast (937 6.2 

C19035 Hs.164259 ESTs 33 
AA534222 gb:nJ21d02.s1 NCLCGAPJW Homo sapiens 43 

AA321166 Hs.278233 ESTs ■ 3.4 
AA984373 Hs.90790 Homo sapiens cDNA: FU22930 fis, done K 4.1 
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320187 T99949 Hs.303428 Homo sapiens cDNA FU14832 fis, done OV 5.3 

320211 ALQ39402 Hs.125783 DEME-6 protein 9.2 

320416 AI026984 Hs.293662 ESTs 3.1 

320588 U78082 Hs.167738 RNA polymerase )l transcriptional regula 3.1 

5 320635 N50617 . Hs.80506 small nudear rioonucleoprotein polypept 6.1 

320654 AI160015 Hs.1 18112 ESTs 3.5 

320742 AI601188 Hs.120910 ESTs 3 

320832 AA214584 Hs.290167 ESTs 3.7 

320915 A1359144 Hs.143688 Homo sapiens cDNA: FU23031 fis, done L 3.1 

10 321016 BE144167 Hs.49994 hypothetical protein amOar to RNA-t>ind 3.3 

321107 AI732643 Hs.144151 ESTs 12.3 

321171 AI769410 Hs.221461 ESTs 3.3 

321253 M610649 Hs.333239 ESTs 3 

321318 AB033041 Hs.137507 vang (van gogh, DrosophilaHike 2 3.9 

15 321642 AI432199 Hs.247084 ESTs 3 

321644 AW975944 Hs.237396 ESTs 11.7 

321683 AI471598 Hs.197531 ESTs 3.8 

321753 U29112 Hs.196151 ESTs 4.4 

321811 D80630 gb:HUM091D02B Human fetal brain (TFujlwa 35 

20 321828 R59890 Hs.83623 nudear receptor subfamily 1, group I, m 3.1 

321910 HS7055 Hs.271530 ESTs, Weakly similar to ALU7_HUMAN ALU S 4.7 

321937 AL049351 Hs.302058 Hc^sapier«m^;cDNADKFZp566C093(fr 3.5 

321978 N77342 Hs.21851 Homo sapiens cDNA FU12900 fis, done NT 5 

322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

25 322136 AF075083 gb:Homo sapiens full length insert cDNA 3.6 

322258 BE265745 Hs.194359 ESTs, Weakly similar to ALUCJtUMAN llil 3 

322296 W76326 gb:zd60d04.r1 Soares_fetaLhearLNbHH19W 4.4 

322303 A1357412 Hs.157601 ESTs 11.5 

322476 AW963372 Hs.46677 PRO2000 protein 3 

30 322520 T55958 gb:yb35t05.r1 Siratagene fetal spleen (9 3 

322521 AF147347 gb;Homo sapiens fall length insert cDNA 45 

322567 AF155108 Hs.256150 Homo sapiens, Similar to RIKEN cDNA 2810 4 

322595 W92147 Hs.1 18394 ESTs 5.4 
322675 AA017656 gb:ze39h01 /I Soares retina N2b4HR Homo 3.1 

35 322766 AW068805 Hs588467 Homo sapiens cDNA FU12280 fis, done MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322882 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, done HE 5.9 
322975 C16391 gb:C16391 Clontech human aorta polyA mRN 16.5 

323091 AI902456 Hs510761 ESTs, Weakly similar to I38022 hypotheti 4 

40 323131 AK002088 Hs570124 Homo sapiens cDNA FU1 1226 fis, done PL 3.3 

323168 AL120862 Hs.124165 programmed cell death 9 (PDC09) 6.3 

323244 AW675572 Hs.193620 ESTs 4.6 

323262 AL133990 Hs.190642 ESTs 10-5 
323332 AI829520 gb:wi19c06.x1 Nd.CGAP.Ut1 Homo sapiens 65 

45 323333 AV651680 Hs.208558 ESTs 4.3 

323335 AI655499 Hs.161712 ESTs 9-2 

323645 AW445014 Hs.197746 ESTs 3.1 

323663 BE081058 Hs.243023 ESTs 4 

323693 AA317962 Hs.249721 ESTs, Moderately similar to PC4259 fern 3 

50 323782 AW961560 Hs.97600 ESTs 35 
323817 AA410943 BMP-R1B 6.4 

323930 AL043683 Hs.8173 hypothetical protein FU10803 3.3 
323974 AI825204 Hs.211406 ESTs 4.5 
324001 AU044949 Hs.116298 ESTs 4,5 
55 324036 AW72078 Hs.303662 ESTs 8.4 
324261 BE069341 gb:QV3^T0381-2701(KW)73^08 BT0361 Homo 49.4 

324285 AA431159 Hs.122954 ESTs 3 
324296 AI524039 Hs.192524 ESTs 3 
324305 AA642007 Hs.1 16369 ESTs 3.3 
60 324432 AA464510 Hs.152812 ESTs 165 
324585 AI823969 Hs.132678 ESTs 3.3 
324598 AW972227 Hs.1 63986 Homo sapiens cONA: RJ22765 fis, done K 5 
324603 AW993522 Hs592934 ESTs 10.4 
324631 AA9371 16 Hs593683 ESTs, WeaWy similar to 154374 gene NF2 3.3 
65 324716 BE169746 Hs.12504 likely ortholog of mouse Arkadia 3.2 
324748 AW974941 Hs592385 ESTs, WeaWy similar to 178885 seiine/th 3 
324771 AA631739 Hs-335440 EST 3 
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324774 A1031771 Hs.132566 ESTs 4.2 

324823 AW516704 Hs.208726 ESTs 3.4 

324824 AI826999 Hs.224624 ESTs 3.1 
324826 AA704806 Hs.143842 ESTs, Weakly similar to 2004399A chromos 4.4 

5 324961 AA613792 gb:no97h03.s1 NCLCGAP_Pr2 Homo sapiens 3.9 

324987 AI375572 Hs.172634 ESTs 18.8 

324994 AI805416 Hs.213897 ESTs 3.3 

325146 AI064690 Hs.171176 ESTs 4.2 

nL__. nan r-.. A A 

JlOJ/l rfiaSe & a o cavm 

10 325544 Phase2&3Exons 5.7 

327075 Phase2&3Exons 3.8 

332798 C22000007:gi|12314195|ernb|CAB99338.1| (A 4.3 

334223 NM_005080*:Homo sapiens X-box binding pr 26.2 

334447 NM_012429 4 :Homo sapiens SEC14 (S. cerevl 3.9 

15 335809 NM 014509 4 :Homo sapiens kraken-iike (BK1 10.1 

335824 ENSP00000249072 , :OJ222E13.1 (N-TERMINAL 20 

338255 NM - 014323*:Homo sapiens zinc finger prot 9 

409430 R21945 Hs.166975 splicing factor, aiglnine/serine-rich 5 4 

428046 AW812795 Hs.155381 ESTs, Moderately similar to 138022 hypot 4.6 

20 432558 R97268 Hs.177269 ESTs 3.2 

436808 AA731602 Hs.120266 ESTs 3.9 

448569 BE382657 Hs.21486 signal transducer and activator of trans 4.1 

453542 AW836724 Hs.339660 Homo sapiens mRNA expressed only In plac 3.7 

M97935 AFFX control: STAT1 3.2 

25 M97935 AFFX control: STAT1 3 

M55150 fumarylacetoacetata 3 

M13755 Interferon stimulated protein; 15 kDa 4.5 

A1052047 ESTs 6.7 

AA252033 ESTs; Weakly stmQarto HI! ALU SUBFAMILY J 3.2 

30 AA401739 ESTs 3.3 

H18459 hepatocellular cardrKwna associated protein; 3 

R48744 ESTs 4.2 

M31682 inhibln; beta B(activinAB beta polypeptide) 3 

AA416873 ESTs 3 

3 5 080240 HUM5G1 1 A Human fetal brain (TFujtwara) Homo 4 

R49590 ESTs 3.2 

CH22_FGENES.678_5 16.8 

CH22 FGENES.619J 12.9 

CH22_FGENES.619_12 11.3 

40 . CH22_EM:ACOO5500.GENSCAN.127 9 9.2 

CH22.EM:AC005500.GENSCAN.304 2 8.5 

CH2^FGENES.271J 8.4 

CH22JGENES.619J3 8 

CH22_FGENES.271_7 7.3 

45 CH22»FGENES.617J r 7.2 

CH.07 hsgi|6004473 7.1 

CH22JFGENES.264J 6.8 
X03363 HER2 receptor tyrosine kinase (cerbB 2; ERBB2;neu) 6.6 

CH22JGENES.617_9 6.5 

50 CH.07 hsgi]5868264 5.8 

CH.19_hsgi|5867439 5.7 

CH22J : GEhES.63 5.3 

CH.17JisgP67230 5.1 

CH.20_hsgi|6552458 5.1 

55 CH22^EMAC005500.GENSCAN.14822 4.7 

CH22_FGENES.669_10 4.6 

AA034918 KIAA1028 protein 4.6 

CH22_FGENES.48_12 4.5 

CH22 W FGENES.118J2 4.5 

60 AF049569 ESTs 4.4 

M13955 multiple UniGene matches 4.3 

CH2a_FGENES.619_8 4.3 

CH2^_FGENES.137 4.3 

HG4126HT4396 Zinc Finger Protein Hzf4 4.3 

65 CH22_FGENES.360JJ 4.3 

CH22_FGENES.706_9 4.3 

CH.21Jisgij6531965 42 
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HG2614HT2710 
XS3535 

HG4716HT5158 
HG4677HT5102 



AA714311 



HG2465HT4871 



AA707750 



AA976074 



CR17_hsgl]5867215 
CH2ajGENES.669J 

CH22_FGENES.48J8 

matrix metaJloproteinase 14 (membrane inserted) 
CH22_FGENES.271_6 
CH22_FGENES.617_3 
CH2a.FGENES.290_8 

CH22.FGENES.135 
CH2a.FGENES.13 2 
CH.14_hsgi|6682474 
CH.02_hsgi|5867750 
CH22.FGENES.617J 

CH22_DJ32I1 0.GENSCAN.23 39 

CH22_FGENES.543J0 

CH22_EMAC005500.GENSCAN.96 1 

CH22_FGENES.204_2 

CH22_FGENES.619_4 

CH.16_hsgi|5867087 

EST cluster (not in UniGene) 

CH22_EMAC005500.GENSCAN.149 9 

CH22_EMAC005500.GENSCAN.421 5 

CH22_FGENES.134 

CH.07_hsgil6004478 

CH22LFGENES.360J 

CH22_FGENES.6_2 

CH2a.C20H12.GENSCAN.16 2 

CH2a.C65E1.GENSCAN.81 

ESTs; Weakly similar ta cis Golgl matrix 

CH22_FGENES.307J 

CH2a.EMAC005500.GENSCAN.248 14 

CH.06_hsgi|5902482 

CH22_FGENES.669_5 

CH22_DJ32I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330J0 

CH2a.FGENES.14 2 

ESTs 

CH2a.FGENES.226 7 
CH2a.FGENES.13 3 
CH22_EMAC005500.GENSCAN.209 12 
CH2a_FGENES.271_3 



4.1 
4.1 

Collagen, Type Viii, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5* Monophosphate Synthase 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2, Fusion Activated 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Dna Binding Protein Ap 2, Aft. Splice 3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 



3.8 



3.7 



3.3 



275 



WO 02/059377 



PCT/US02/02242 



TABLE 17A 



10 



Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey CAT number Accession 

116845 393481J AA649530 AA659316 H64973 
103207 30635.-4 X72790* 

126257 1 82217 J N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186 J AF080229 AF080231 AF080230 AF080232 AF0B0233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI6338 18 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI58371 8 AI672574 
N25695 AW665466 AI818326 AA1261 28 AW80345 AW01 3827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 A1867418 AW818140 AA502500 AI206199 A1671282 AI352545 BE50103Q 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE466611 AI206344 AA574397 
AA348354 AI493192 

126872 142696J AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
1746257 J R82040 R70934 
176835J AA225084AA302713 

324239J AI311928 AA936030 T51931 AA609816 AA487195 AA664207 
genbanl^F09609 F09609 
genbanleR08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbankJW81598 W81598 

genbankJV84768 W84768 

genbanlcN22401 N22401 

504187J AA121022AA126422 

genbank_R54797 R54797 

46802.1 AF075083 H52291 H52528 

47334.1 W76326AF086341W723OO 

1527481J D80630 D80896 D80895 

293660 J AW979268 AA878419 AA431342 AA431628 

38916J T55958 T57205 AF147346 

38917J AF147347 T55426 T55503 

86787J AA017656 AA017374 AA019761 

179142J A1829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440 J AW33540 AA728984 AA804981 
1510563J C16391 C16413 

273265 J BE069341 AW748403 AL044891 AI908240 AA393080 

233566 J AA410943 AW948953 AA334202 AA332882 

128335.1 T97905AA101672 

376239J AA613792 AW182329 T05304 AW858385 

284260J AW299459AA417112 

386364 J AW629759 AW749955 AA633408 AI651005 

357371J AA761668 AA573621 R92814 R09670 

174129J AA216387 T63548 AA228676 

112523_1 AA071267 T65940 T64515 AA071334 

345248.1 AA534222 AA632632 T81234 

179960J AA740616 AA654854 AA229923 

103087 J AA046309 AI263500 AA046397 
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308106 AI4768Q3 

338255 CH22 6856FG_JJNlCEMrtC0O 

335809 CH22_3181FGJ17_6_UNK_EM 

335824 CH2^3197FG_619J1JJNK.E 

5 307010 AI140Q14 

307041 AI144243 

305913 AA876109 

305917 AA876469 

309574 AW168083 

10 325372 c12_hs 

325544 c12Jis 

332798 CH22J4FGA.5JJNKJMG1.G 

334223 CH22J507FG_360JJJNK_EM 

327075 c21_hs 

15 334447 CH22J746FG_387_7JJNK_EM 

304782 AA582081 

313434 441798.1 W92070AW019952 W92053 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene CD's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction, Nucleotide locations of each predicted exon are also 
listed. 

Ptey: Unique number corresponding to an Eos probeset 

1 0 Ret: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled -The DNA sequence of human chromosome 22." Dunham I. et aL, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nlposite Indicates nucleotide positions of predicted exons. 



15 



Pfcey Ref Strand NLposition 



334447 Dunham, 1. eLai. Plus 14308764-14308824 

335809 Dunham, I. etai. Plus 26310772-26310909 

20 335824 Dunham, UtaL Plus 26376860-26376942 

332798 Dunham, 1. etaL Minus 232147-231974 

334223 Dunham, I. elaL Minus 12734365-12734269 

338255 Dunham, I. etaL Minus 15242294-15242231 

325372 5866920 Minus 1117061-1117304 

25 325544 6682452 Pius 171228-171286 

327075 6531965 Plus 40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 



Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 U UnigenelO: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



15 Pkey ExAccn UnigenelO Unigene Title 



101378 BE563085 Hs.833 
101530 M29874 Hs.1360 
101767 M81057 Hs.180884 

20 101878 M97815 Hs.183650 
103010 X52509 Hs.161640 
104115 AF183810 Hs.26102 
104825 AA035613 Hs.141883 
107105 AW963419 Hs.155223 

25 108819 AA011449 Hs.271627 
112287 ABQ33064 Hs.334806 
112561 AI791493 Hs.129873 
112637 R82331 Hs.164599 
113206 BE262470 Hs.241471 

30 113970 W27249 Hs.8109 
114965 AI733881 Hs.72472 
118925 N92293 Hs.206832 
119905 AW449064 Hs.119571 
121611 M31669 Hs.1735 

35 129301 AF182277 Hs.330780 
133976 AI908165 Hs.169946 
134731 089377 Hs.89404 
300254 AW183618 Hs.55610 
301884 AA312082 Hs.105445 

40 302001 AB020711 Hs.278346 
302067 BE542706 Hs.222399 
302276 AW057736 Hs.323910 
302290 AA179949 Hs.175563 
302372 AL117406 Hs.200102 

45 302385 AJ224172 Hs.204096 
309177 A1951118 Hs.326736 
309583 AW170035 Hs.326736 
310781 AI380797 Hs.158992 
311166 AI821005 Hs.118599 

50 311935 AA216387 

312153 BE261944 Hs.118625 
313328 AW449211 Hs.105445 
313915 C18863 Hs.163443 
314097 AA648744 Hs.269493 

55 314138 AA740616 

314506 AA833655 Hs.206868 
314558 AI873274 Hs.190721 
314691 AW207206 Hs.136319 
315006 AI538613 Hs.298241 

60 315021 AA533447 Hs.312989 
315051 AW292425 Hs.163484 
315060 AA551104 Hs.189048 



R1 

interferon-stimulated protein, 15 kOa 5.3 

cytochrome P4S0, subfamily IIB (phenobar 9 

carboxypeptidase B1 (tissue) 12 

cellular retinoic acid-bhdbg protein 2 6.5 

tyrosine aminotransferase 12.4 

opposite strand to trlchorhinophaiangeal 7.6 

ESTs 6.9 

stanniocalcin2 5.3 

ESTs 6.1 

KIAA1238 protein 7.3 

ESTs, Weakly similar to A36036 cytochrom 8.2 

ESTs 5.4 

RNB6 6.2 

hypothetical protein FU21080 6.9 

BMP-R1B 10.1 

ESTs, Moderately similar to ALU8_HUMAN A 19.7 

collagen, type 111, alpha 1 (Ehiers-Dant 8.4 

inhibtn. beta B (activtn AB beta polypep 5.6 

cytochrome P450, subfamily IIB (phenobar 6.2 

GATA-binding protein 3 (T-ceD receptor 6.2 

msh (DrosophHa) homeo box homofog 2 5.8 

solute carrier family 30 (zinc transport 9.9 

GDNF family receptor alpha 1 5,7 

KIAA0904 protein 7.7 

CEGP1 protein 7.3 

HER2 receptor tyrosine kinase (c-erb-b2, 5.4 

Homo sapiens mRNA; cONA DKFZp564N0763 (f 34.1 

ATP-b'mding cassette transporter MRP8 6.7 

DpophiCn B (uteroglobin family member) 13.8 

Homo sapiens breast cancer antigen NY-8R 17.3 

Homo sapiens breast cancer antigen NY-BR 57.6 

ESTs 10.2 

ESTs 10.8 

gb:nc16b02.s1 NCLCGAPJ>r1 Homo sapiens 5.2 

hexokinase 1 5.2 

GDNF family receptor alpha 1 12.4 

Homo sapiens cDNA FU1 1576 fis. done HE 26.3 

ESTs 6.6 
gbmy97f11.s1 NCLCGAPJ3CB1 Homo sapiens 5.9 

Homo sapiens cONA FU14056 fis, done HE 8.5 

ESTs 27.4 

ESTs 20.7 

Transmembrane protease, serine 3 10.9 

ESTs 5.3 

ESTs 1Z9 

ESTs, Moderately similar to ALUCJ1UMAN I 5.8 
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315196 AI367347 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ8.2 

315530 AW015415 Hs.127780 ESTs 8.9 

315834 AA837085 Hs.220585 ESTs 63 

316012 AA764950 Hs.119898 ESTs 7 

316177 AI904982 Hs.293102 ESTs, Moderately similar to ALU1 HUMANA 307 

316580 AA938198 Hs.146123 poly(A) polymerase gamma 9.4 

317803 AW664964 Hs.128899 ESTs 6.1 

317881 A1827248 Hs.224398 Homo sapiens cDNA FU11469 fis, clone HE 9.6 

318740 NMJX)2543Hs.77729 oxidised tow density lipoprotein (lectin 7.3 

318744 AI793124 Hs.144479 ESTs 17.8 

320211 AL039402 Hs.125783 OEME-6 protein 9.2 

321107 AI732643 Hs.144151 ESTs 12.3 

321644 AW975944 Hs.237396 ESTs 11.7 

321978 N77342 Hs.21851 Homo sapiens cONA FU12900 fis, done NT 5 

322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

322766 AW068805 Hs.288467 Homo sapiens cDNA FU12280 fis, clone MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322975 C16391 gb:C16391 Clontech human aorta polyA mRN 165 

323262 AL133990 Hs.190642 ESTs 105 

323332 AI829520 gbwl19c06jc1 NCLCGAPJJtl Homo sapiens 6.2 

323817 AA410943 BMP-R1B 8.4 

324261 BE069341 gb:QV3-BT0381-2701(HW)7^c08BT0381 Homo 49.4 

324432 AA464510 Hs.152812 ESTs 165 

324598 AW972227 Hs. 163986 Homo sap/ens cDNA: FU22765 fe, clone K 5 

324603 AW993522 Hs.292934 ESTs 10.4 

324987 AI375572 Hs.172634 ESTs 18.8 

325544 Phase2&3Exons 5.7 

330388 Hs.46 HER2 receptor tyrosine kinase (c-erb-02, 6.6 

334223 NM_005080*:Homo sapiens X-box binding pr 26.2 

335809 NM_014509*:Homo sapiens kraken-Jike (BK1 10.1 - 

335824 ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

AI052047 ESTs; Weakly similar to CYTOCHROME P450 6.7 

R72427 CH22_EMAC005500.GENSCAN.127 9 55 

CH22J : GENES.619_13 9.2 

CH22.FGENES.617 9 8 

CH22LFGENES.271 - 7 65 

CH22J=GENES.619_7 7.3 

CH22J : GENES.271_8 12.9 

CH22.FGENES.619J2 6.4 

CH22_EM:AC005500.GENSCAN.304 2 1 1 .3 

CH.07_hsglJ6004473 8.5 

CH22_FGENES.617_7 7.1 

CH22_FGEHE$m 5 72 

CH22_FGENES.678_5 16.8 
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TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigeneED's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



323332 179142J A1829520 AI791832 AA228414 AI791823 AA22921 1 AA229315 

20 322975 1510563J C16391C16413 

324261 273265.1 BE069341 AW748403 AL044891 A1908240 AA393080 

323817 233566 J AA410943 AW948953 AA334202 AA332882 

311935 174129.1 AA216387T63548AA228676 

314138 179960J AA740616 AA654854 AA229923 

25 335809 CH22_3181FGJ17_6JJN1^_EM 

335824 CH22_3197FG_619J1JJNK_E 

325544 c12_hs 

334223 CH22J507FG.360JJJNK-EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigehe ID's and 
5 accession numbers in Table 1 8. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham L et al.. Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand Nt_posHion 

335809 Dunham, I. etal Plus 26310772-26310909 

20 335824 Dunham, I. etal. Plus 26376860-26376942 

334223 Dunham, I. etal. Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 



10 Pkey: 
Ref: 

Strand: 
NLposirjon: 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 



10 



Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
malignant tissues. In order to remove gene-specific background levels of non-specific 
hybridization, the 15 lh percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title; Unigene gene tide 

R 1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD UnigeneTitte 



408591 
406964 
400291 
407277 
449746 
426878 
400292 
427585 
408045 
407178 
407377 
450705 
407212 
428848 
404561 
407980 
447350 
450375 
422109 
435496 
453160 
420813 
415989 
422505 
424399 
423575 
429441 
431474 
448595 
427217 
402578 
422805 
424634 
456207 
424086 
459587 



AF015224 

M21305 

AA401369 

AW170035 

AI668594 

BE069341 

AA250737 

D31152 

AW138959 

AA195651 

C16391 

U90304 

AA412108 

NMJJ00230 

AA046309 

AI375572 

AA009647 

S73265 

AW840171 

AI263307 

X51501 

AI267700 

AL120862 

AI905687 

C18863 

AJ224172 

AL133990 

AB014544 

AA399272 

AA436989 

NMJJ03613 

AA193450 

AI351010 

AA031956 



Hs.46452 

Hs.190721 
Hs.326736 
Hs.176588 

Hs.72472 
Hs.179729 
Hs.245123 
Hs.104106 

Hs.25351 

Hs.269350 

Hs.194236 



Hs.172634 

Hs.8850 

Hs.1473 

Hs.265398 

Hs.239884 

Hs.99949 

Hs.317584 

Hs.124165 

Hs.2533 

Hs.163443 

Hs.204096 

Hs.190642 

Hs.21572 

Hs.144341 

Hs.121017 
Hs.151407 



mammaglobin 1 

gbfluman alpha satellite and sateEte 3 
ESTs 

Homo sapiens breast cancer antigen NY-8R 
ESTs. Weakly sirnilarto CP4YJKJMAN CYTOC 
gb:QV3^T0381-270irj(H)73-c08 BT0381 Homo 
BMP-R1B 

collagen, type X, alpha 1 (Schmld metaph 
ESTs 
ESTs 

gb:C16391 Clontech human aorta pdyA mRN 
Iroquois homeobox protein 5 
ESTs 

teptin (murine obesity homolog) 
trichorhinophalangeal syndrome I (TRPS1) 
gb:zf12f01.s1 So^resJetal_hearLNbHH19W 
ESTs 

a disintegrin and metatloprotelnase doma 
gastrin-reteasing peptide 
ESTs. Weakly similar to transformations 
H2B hlstone famSy, member L 
proladin-induced protein 
ESTs 

programmed ceD death 9 (PDCD9) 
aldehyde dehydrogenase 9 family, member 
Homo sapiens cDNA FU1 1576 fis, done HE 
lipophilin B (uteroglobin family member) 
ESTs 

K1AA0644 gene product 
ESTs 

C1001134:gq2117372|ptr||i65981 fatty ac 
H2A hbtone family, member A 
cartilage Intermediate layer protein, nu 
gtozr40e07.fi Scares JIhHMPu_S1 Homosapi 



Hs. 102267 tysyl oxidase 



gtozk15e04.s1 StjaresjMegnanLuterusJJbH 



R1 

137.6 

71.0 

68.4 

54.2 

46.4 

44.8 

37.4 

32.9 

31.9 

30.4 

27.7 

24.8 

22.0 

21.9 

21.8 

19.8 

17.3 

16.6 

16.5 

16.0 

15.8 

15.8 

15.5 

14.8 

14.5 

137 

13.6 

13.5 

13.0 

12.8 

12.6 

1Z2 

12.0 

11.9 

115 

115 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



416203 
429170 
407276 
434377 
448390 
452401 
421037 
452461 
443348 
421155 
402606 
409269 
447268 
447033 
400295 
424905 
432441 
427365 
438950 
422835 
425692 
411869 
439820 
445730 
459583 
452744 
432596 
400297 
449448 
423945 
406348 
424735 
453392 
433365 
405654 
418601 
451110 
408771 
409041 
442353 
451561 
424001 



AW291168 

NM_001394 

AI951118 

AW137148 

AL035414 

NMJJQ7115 

AI684808 

N78223 

AW373598 

H87879 

AA576953 

AI370413 

AI357412 

W72838 

NNLQ02497 

AW292425 

AI873274 

H23789 

BE218705 

D90041 

W20027 

AL360204 

AI624342 

A1907673 

AI267652 

AJ224741 

AJ127076 

D60730 

AA410943 



423887 

405095 

419296 

447164 

413472 

416747 

415385 

434424 

420931 

406687 

400285 

437207 

427119 

429534 

433426 

411078 

426214 

447475 

415263 

439569 

414142 

426261 

444783 

445885 



U31875 
U23752 
AF026944 
NA 

AA279490 
AI955040 
AW732573 
AB033025 
BE379594 
N52812 
W67883 
NM.007050 
AL080207 
NA 

AA236115 

AF026941 

BE242870 

AW876523 

R17798 

A1811202 

AF044197 

M31126 

NA 

T27503 

AW880562 

AW976987 

H69125 

AI222020 

H59846 

AI380797 

M948033 

AW602166 

AW368397 

AW242243 

AK001468 

A/734009 



Hs.22972 

Hs.36563 

Hs.157601 

Hs.2533 

Hs.153704 

Hs.163484 

Hs.190721 

Hs.144530 

Hs.121378 

Hs.155956 

Hs.23439 

Hs.283853 

Hs.170042 

Hs.30504 
Hs.278461 
Hs.334473 
Hs.57471 



Hs.41295 ESTs, WeaWy similar to MUC2_HUMAN MUCIN 
Hs.2359 dual specificity phosphatase 4 
Hs.326736 Homo sapiens breast cancer antigen NY-BR 
Hs.306593 Homo sapiens cONA FU1 1382 tis, done HE 
Hs.21068 hypothefical protein 
Hs.29352 tumor necrosis factor, alpha-induced pro 
Hs.197653 programmed ceil death 9 (PDCD9) 
Hs.108106 transcription factor 
Ks.182278 caimoduiln 2 (phosphorylase kinase, delt 
Hs.102267 lysyl oxidase 

NMJ)24626:Homo sapiens hypothetica! prot 
hypothetical protein FU13352 
hypothetical protein RJ22418 
ESTs 

aldehyde dehydrogenase 9 family, member 
NiMA (never in mitosis gene a)-refated k 
ESTs 
ESTs 
EST 

mebfolhfonein-fike 5, testis-specific 
N-acetyltransferase 1 (arylamine Nicety 
ESTs 

Homo sapiens mRNA full length insert cON 
ESTs 

gb:IL-BT152-080399-004 BT152 Homo sapien 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
matriGn3 

hypothetical protein DKFZp56401278 
ESTs 

gburt32h03/1 Soares ovary tumor NbHOT H 
Target Exon 

short-chain alcohol dehydrogenase family 
SR Y (sex determining region YVbox 1 1 
ESTs 

C12001521^i|7513934|pir|(T31081 cca3 pr 
calmegin 

ESTs, Weakly similar to transfonnation-r 
potassium voltage-gated channel, delayed 
KIAA1 199 protein 

ESTs. Moderately similar to ALU7_HUMAN A 
ESTs 

paternally expressed 10 
protein tyrosine phosphatase, receptor t 
OKFZP434G232 protein 
Taraet Exon 
ESTs 

Homo sapiens cig5 mRNA, partial sequence 
solute carrier family 1 (glial high affi 
hypothetical protein RJ12910 
COBW-fflce protein 

Homo sapiens cDNA: FU23523 fis, done L 
small inducible cytokine B subfamily (Cy 
matrix metalloprotelnase 1 1 (MMP11; stro 
Eos Control 

hypothetical protein FU12910 
ESTs 

ESTs, Weakly similar to 2109260A B ceil 
ESTs 

CocoaCrisp 

ESTs, Moderately similar to ALU7 J1UMAN A 
ESTs 
ESTs 

CEGP1 protein 

Homo sapiens cDNA FU14438 fis, done HE 



Hs.272499 
Hs.32964 
Hs.293797 

Hs.66368 
Hs.265398 
Hs.47584 
Hs.50081 
Hs.49136 
Hs.177403 
Hs.137476 
Hs.225952 
Hs.134585 

Hs.120785 
Hs.17518 
Hs.75379 
Hs.15929 
Hs.7535 
Hs.325335 
Hs.100431 
Hs.272620 

Hs.15929 
Hs.1 14574 
Hs.163327 
Hs.133525 
Hs.182364 
Hs.128355 
Hs.158992 
Hs.130853 
Hs.222399 
Hs.150042 
Hs.166670 
Hs.62180 
Hs.127699 



anfllin (Drosophila Scraps homotog), act 
K1AA1603 protein 



115 
11.5 
11.4 
11.3 
11.1 
11.0 
10.9 
10.7 
10.6 
105 
10.4 
10.4 
10.3 
10J2 
10.1 
10.1 
9.9 
9.9 
9.8 
9.7 
9.7 
9.6 
9.6 
9.5 
9.3 
92 
9.1 
9.1 
9.1 
9.1 
9.0 
9.0 
9.0 
8.8 
8.8 
8.8 
8.7 
8.5 
8.4 
8.3 
8.2 
82 
8.1 
8.1 
8.1 
8.0 
8.0 
8.0 
8.0 
7.9 
7.9 
7.9 
7.8 
7.7 
7.6 
7.5 
75 
7.5 
7.4 
7.4 
7.3 
72 
72 
7.1 
7.0 
63 
6.9 
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429432 A1578059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 AI732643 Hs.144151 ESTs 6.9 

421373 M808229 Hs.167771 ESTs 6.8 

5 451398 AJ793124 Hs,144479 ESTs 6.8 
404253 NM_021058^omo sapiens H2B histone fami 6.8 

441098 AI015591 Hs.131004 EST s, Weakly similar to T17227 hypotheti 6.7 

426215 AW963419 Hs.155223 stanniocalcin 2 " 6.6 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B {CX 6.6 

10 422956 BE545072 Hs.122579 hypothetical protein FU10461 6.6 
411111 AW818127 gb:CM1-ST0277-061299^)59-b07ST0277Homo6.6 

434988 A1418055 Hs.161160 ESTs 6.6 

442580 AI733682 Hs.130239 ESTs 6.6 

449611 AI970394 Hs.197075 ESTs 6.6 

15 408000 111690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein 6.5 

400301 X03635 Hs.1657 estrogen receptor 1 6.5 

427356 AW023482 Hs.97849 ESTs 6.5 

20 425704 U79293 Hs.159264 Human done 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 (xKutermtimlcaddlMmg protein! 6.4 

424902 NM.003866 Hs.153687 inositol polyphosphates-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein FU23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 6.2 

25 444342 NM_014398 Hs.10887 sintototysosofne^ssociated membrane 6.1 

422168 AA586894 Hs.1 12408 S100 caJriuin-binding protein A7 (psorias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 matrix metaUoprotelhase 1 (MMP1; Inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

30 418092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA3.10693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs^37396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNA FU 12280 fis, clone MA 5.9 

35 425707 AF1 15402 Hs.11713 E74-fike factor 5 (ets domain transcript 5.9 
410785 AW803341 gb:IL2AJMQ079490m050-D03 UM0079 Homo 5.9 

425398 AL049589 Hs.156369 hypothetical protein similar to tenascin 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01j1 Soares infant brain 1N1BH 5.8 

40 411284 N28519 Hs.135191 ESTs, Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transaipfion factor AP-2 beta (activati 5.8 

451807 W52854 Hs.27099 hypothetical protein FU23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 HsJ2472 BMP-RiB 5.6 

45 438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU2304 5 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

50 434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2_HUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP-speciffc 40 kDa protein (hPrp8- 5.4 

447102 BE167434 Hs.98471 ESTs, Weakly similar to T18712 hypo&ieti 5.4 

55 451621 AI879148 Hs.26770 fatty ackl binding protein 7, brain 5.4 

425236 AW067800 Hs.155223 stanniocalcin 2 53 

421464 AA291553 Hs.190086 ESTs 5.3 
450736 AW970060 gb:EST382140 MAGE resequences, MAGK Homo5 J 

428085 AA421081 Hs.12388 ESTs 5.3 

60 452838 U65011 Hs.30743 preferentialiy expressed antigen In mela 5.3 

445424 AB028945 Hs.12696 cortactin SH3 domain-binding protein 53 

456938 X52509 Hs.161640 tyrosine aminotransferase 53 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein {COM 5.2 

438167 R28363 Hs.24286 ESTs 52 

65 433330 AW207084 Hs.132816 hypothetical protein MGC14801 52 

449765 N92293 Hs.206832 ESTs. Moderately similar to AUJ8 JftJMAN A 5.2 

416276 U41060 Hs.79136 UV-1 protein, estrogen regulated 52 
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400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1069 protein 5.2 
455047 AW852530 gb:PM1£T0243471099401^06CT0243 Homo 5.2 

419169 AW851980 Hs.262346 ESTs. Weakly similar to S72482 hypolheti 5.2 

453197 AI916269 Hs.109057 ESTs, Weafdy similar to ALU5_HUMAN ALUS 5.1 

400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

427666 AI791495 Hs.1 80142 calmoduIln-Bke skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.1 16369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2001837 # :gi|12697903|db]lBAB21 770.11 (A 5.1 

452930 AW195285 Hs.194097 ESTs, WeaHy similar to 138022 hypotheti 5.1 

444910 A1201849 gb:qs76g04.x1 NCLCGAP_Pr28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute earner family 6 (neurotransmitte 5.0 

444381 BE387335 Hs.283713 ESTs, WeaWy similar to S64054 hypotheti 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient yeast, h 5.0 

428804 AK000713 Ks.193736 hypothetical protein FU20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 Hs.65792 syntrophin, gamma 2 4.9 

431808 M30703 Hs.270833 amphireguDn (schwannom*derived growth 4.9 

418836 A1655499 Hs.161712 ESTs 4.8 

442441 AI820662 Hs.129598 ESTs 4.8 

435635 AF220050 Hs.1 81 385 uncharacterized hematopoietic stem/proge 4.8 

400286 NA C16000922.-gi|7499103lpir|IT20903hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4,8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

457726 A1217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA1717 protein 4.8 

428358 BE440042 Hs.83326 matrix metaBoproteinase 3 (stromelysin 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, done HE 4.7 

407819 R42185 Hs.274S03 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated, Cam 4.7 

422896 AW961459 Hs.154116 ESTs 4.7 

453616 NMJJ03462 Hs.33846 dyneln. axonemal, fight intermediate pol 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, Moderately simHar to $65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799^gil6330365idbp\A86508.1|(AB 4.6 

444649 AW207523 Hs.197628 ESTs 4.6 
429431 240313 Hs. 106330 Homo sapiens clone IMA6E:23371,mRNAseq 4.6 

427811 M81057 Hs.1 80884 carboxypeptidase B1 (tissue) 4.6 

447342 A1199268 Hs.19322 Homo sapiens, SbnllartoRlKENcONA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein FU20275 4.6 
454307 AW855717 gb:RC1-CT0279481299413^1 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QV0-OT002(W)90200-107-g07DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

457430 AA514660 Hs.128443 ESTs 4.6 
416030 H15261 Hs.21948 ESTs 4.6 
447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE:3937015, mRNA, 4.6 
445537 AJ245671 Hs.12844 EGF-like-domaln, multiple 6 (EGFL6) 4.5 
424590 AW966399 Hs.46821 hypothetical proten FU20086 4.5 
432374 W68815 Hs.301885 Homo sapiens cDNA FU11346 fis. clone PL 4.5 
423833 AW503329 gb:UWF-BNO^oc-e^2-(WJlJl NIH.MGCJ0 4.5 
406747 AI925153 Hs.217493 annexinA2 4.5 
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412102 H56435 
431716 089053 
411050 AW814902 
401418 NA 
436194 AK001074 
436211 AKD01581 
414080 AA135257 
424115 AA335497 
415786 AW419196 
442117 AW664964 
452784 BE463857 
432731 R31178 
410534 AW905138 
405196 NA 
430217 N47863 
401793 

415747 AA381209 
423679 AB007975 
400238 NA 
425627 AF019612 
400608 

458634 AV657310 
407771 AL138272 
405906 NA 
405925 NA 
439382 BE247684 
445263 H57646 
407162 N63855 
419536 AA603305 
454359 N71277 
411558 AA102670 
450715 AI266484 
421451 AA291377 
452864 AA033714 
409757 NJO01898 
413043 BE158766 
413499 BE144884 
444619 BE538082 
408380 AF123050 
406992 S82472 
4Q42S5 NA 
425247 NMJ05940 
428046 AW812795 
446163 AA026880 
421147 AW592167 
426451 AI908165 
415227 AW821113 
452176 AA024538 
452862 AW378065 
443646 AW85198 
425523 AB007948 
424687 J05070 
430009 AA894564 
434469 AA534806 
451381 BE241831 
450229 R18717 
455700 BE068115 
431924 AK000850 
438885 AI886558 
401451 

431676 AJ685464 
409092 AI735283 
429270 W60379 
443903 AI220547 
427122 AW057736 
410275 U85658 
432912 BE007371 



Hs.70725 
Hs.31570 
Hs.50831 



gb:yq98e09.r1 Soares fetal fiver spleen 4.5 

Hs.268012 fatty-ac^oenzymeADgas^tong-chain 4 5 
gb:MR1-ST0206-12040(M)22-f08ST0206Homo 4.5 

C14000338*:glI7459502|plr|jS74665 outer 4,5 

Hs.333435 Homo sapiens cONA FU10212 fis, done HE 4.5 

Hs.334828 hypothetical protein FU10719; KIAA1794 4.4 

Hs.47783 B aggressive lymphoma gene 4,4 

Hs.293965 ESTs, Weakly similar to I38022 hypothec 4.4 

Hs.257924 hypothetical protein HJ 13782 4.4 

Hs.128899 ESTs 44 

Hs.151258 hypothetical protein FU21062 4.4 

Hs.287820 fibronectinl 4.4 
gb:QVO-NN1071-280400-207-g07 NN1071 Homo4.4 

C200066^gl|7512792|pir||T12482 hypothe 4.4 

Hs.336901 ribosomal protein S24 4.4 

C17001545^i|5360127|gblAAD4288Z1|AF155 4.4 

gb:EST94257 Activated T-cells I Homo sap 4.4 

Hs.131454 K1AA0506 protein 44 

C19000274*:gi|12741327|ref|XP.008833.2| 4.4 

Hs.297007 membrane-bound transcription tador prat 4.4 

C10001899:gi|7508633|pir||T25392 hypothe 4,4 

Hs.282898 ESTs 43 

Hs.62713 ESTs 4.3 

Target Exon 4.3 

Target Exon 4,3 

Hs.103070 ESTs 4.3 

Hs.42586 K1AA1560 protein 4,3 

Hs.142634 zinc finger protein 4.3 

gbuip12d1U1NCI_CGAP_Pr3 Homo sapiens 4.3 

gbza36e03.s1 Soares fetal liver spleen 4.3 

gamma-aminobutyric acid (GABA) A recepto 4.2 

ESTs, WeaWy similar to KIAA1324 protein 42 

ESTs. 4.2 

Hs.287629 hypothetical protein FU14260 42 

Hs.123114 cystatinSN 42 

gb:IL2+rr0397«071299^)24F02HT0397Homo 42 
gb:CM0-HT0182-O41099-O65^11 HT0182 Homo4.2 

Hs.8172 ESTs, Moderately similar to A46010 X-lin 42 

Hs.44532 diubiquitin 42 

gb.beta -pol=DNA polymerase beta {exon a 4.2 

C600l909:gi|704441|db]IBAA18909.1| {D298 4.2 

Hs.155324 mat^rnetalioproteinase11(MMP11;stro 42 

Hs.155381 ESTs, Moderately similar to I38022 hypot 42 

Hs.25252 prolactin receptor 4.2 

Hs.293299 ESTs 4,2 

Hs.169946 GATA-bindlng protein 3 (T-cetl receptor 42 

Hs.72402 ESTs ^ 42 

Hs.282990 Human DNA sequence from done RP1-28H20 4.2 

Hs.8687 ESTs 4.2 

Hs.164226 ESTs 42 

Hs.158244 KIAA0479 protein 4.1 

Hs.151738 matrix metalloproteinase 9 (gelaftae B 4.1 

Hs.22242 ESTs 4.1 

gb:ab28c02.r1 Stratagene lung (937210) H 4.1 

Hs.172330 hypothetical protein MGC2705 4.1 

Hs.8929 hypothetical protein FU11362 4.1 
gb:CM1-BT0368-061299^g07BTO368 Homo4.1 

Hs.272203 Homo sapiens cDNA RJ2Q843 fis, clone AD 4.1 

Hs.184987 ESTs 4.1 

NMJXM496*:Horno sapiens hepatocyte nude 4.1 

gb^88RJ4jc1 NCLCGAPJT28 Homo sapiens 4.1 

Hs.172608 ESTs 4 1 

Hs.57773 ESTs 41 

Hs.135223 ESTs . 4.1 

Hs.323910 HER2 receptor tyrosine kinase (c-en>b2, 4.1 

Hs.61796 transcription factor AP-2 gamma (activat 4.1 

Hs.200313 ESTs 4.1 



287 




WO 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



403585 
438295 
420380 
431118 
416182 
418S94 
400555 
410079 
427674 
427131 
439759 
429353 
421296 
418819 
424188 
455431 
404142 
441143 
444540 
415579 
452891 
414605 
452281 
417801 
446232 
447377 
437854 
446140 
452240 
459574 
458673 
444858 
452166 
452681 
450192 
406554 
416259 
445813 
451024 
413930 
401781 
415296 
452564 
442500 
419759 
424633 
439699 
428042 
452501 
453049 
443213 
443489 
455092 
401785 
426427 
446009 
436033 
451067 
41934B 
444635 
412140 
403593 
442323 
419854 
433871 
445253 
409542 



AI394151 

AA640891 

BE264901 

NIVL004354 

AA296520 

U94362 

NWL0G3523 

AA448460 

AL359055 

AL117406 

NM.002666 

AA228776 . 

AW954552 

AW938484 

NA 

AI027604 

AI693927 

AA165232 

N75582 

8E390440 

T93500 

AA417383 

A1281848 

X77343 

AL119723 

AA356170 

AI591147 

AI741122 

N99626 

AI199738 

A19486Q7 

AF153330 

AA263143 

NA 

AA573006 
Z42023 
AA442176 
M86153 

F05086 

AA026777 

AI819068 

221336 

A1472106 

AF086534 

AA419529 

AB037791 

BE537217 

BE568414 

AI073512 

BE152428 



AJ989885 

H75391 

BE172186 

AA236645 

A1184268 

AA219691 

NA 

AW016669 

AW664873 

W02410 

AI217928 

AA503020 



Target Exon 4.1 

Hs.37932 ESTs 4.1 

Hs.102406 ESTs 4.1 

Hs.250502 carbonic anhydraseVH! 4.1 

Hs.79069 cyclinG2 4.1 

Hs.89548 soieciin E (endothelial adhesion motecu! 4.1 

Target Exon 4.1 

Hs.58589 glycdgenin2 4.0 

Hs.2178 H2B hisione family, member Q 4.0 

Hs.1 12017 GE36gene 4.0 

Hs.67709 Homo sapiens mRNA full length Insert cDN 4.0 

Hs.200102 ATP-btnoTng cassette transporter MRP8 4.0 

Hs.103253 penlipin 4.0 

Hs.191721 ESTs 4.0 

Hs.142634 zinc finger protein 4.0 
gb:CM0-OTO057-290200-253-d06 DT0057 Homo4.0 

Target Exon 4.0 

ESTs 4.0 

ESTs 4.0 

ESTs 4.0 

ESTs. Weakly similar to OYH9 JttJMAN C1UA 4.0 

gb:601283601F1 NI>tMGC_44 Homo sapiens c 4.0 

Homo sapiens cDNA FU1 1041 fis, clone PL 4.0 

Integrin, beta-like 1 (with EGF-lfte rep 4.0 

retinolc add induced 3 4.0 

transcription factor AP-2 alpha (activat 4.0 

gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

hypothetical protein FU21908 4.0 

ESTs 4.0 

Homo sapiens cDNA RJ14232 fis, clone NT 4.0 

gb:za39d1 1 .rl Soares fete) liver spleen 4.0 

ESTs, Weakly simflar to ALUAJHUMAN III! 4.0 

ESTs 4.0 

solute carrier family 19 (thiamine trans 3.9 

RAD51-interacting protein 3.9 

Target Exon 3.9 

ESTs 3.9 

atanine-glyoxylate aminotransferase 2-ti 3.9 

gb:zw63b08.r1 SoaresJotai_fetus_Nb2HF8_ 3.9 

RAB11A, member RAS oncogene family 3.9 

Target Exon 3.9 

ESTs 3.9 

gb:ze93d 1 .rl SoaresJetoLheartJJbHH19W 3.9 

ESTs 3.9 

actin related protein 3.9 

Homo sapiens cDNA FU1 1663 fis, clone HE 3.9 

ESTs. Moderately similar to ALU1JHUMAN A 3.9 

myxo virus (influenza) resistance 1 , homo 3.9 

hypothetical protein RJ10980 3.9 

ESTs 3.9 

Homo sapiens cDNA: FU22097 Ds, done H 3.9 

ESTs 3.9 
gb:CM04fT0323-151299-126-b04 HT0323 Homo3.9 

NMJ)02275 # :Homo sapiens keratin 15 (KRT1 3.9 

TTK protein kinase 3.9 

ESTs 3.9 

ESTs 3.9 
gb:MRO-HT0559-1103GO-005-h11 HT0559 Homo3.8 



Hs.159650 
Hs.265165 
Hs.222069 
Hs.212875 

Hs.28792 
Hs.82582 
Hs.194691 
Hs.334334 

Hs.26750 
Hs.61232 
Hs.101810 

Hs.208275 
Hs.264680 
Hs.30246 
Hs.24596 

Hs.19173 
Hs.106576 

Hs.75618 

Hs^28142 

Hs.209122 

Hs.135411 

Hs.49303 

Hs.187561 

Hs.76391 

Hs.29716 

Hs.30343 

Hs.145497 

Hs.133916 



Hs.169840 
Hs.231926 
Hs.255748 

Hs.98274 

Hs.339665 

Hs.73625 

Hs.29190 

Hs.87836 

Hs.205555 

Hs.144762 

Hs.36563 



ESTs 3.8 

ESTs 3.8 

RAB6 interacting, kinesin-Bke (rabklnes 3.6 

Target Exon 3.8 

ESTs 3.8 

Homo sapiens PAC done RP5-1087M19 from 3.8 

ESTs 3.8 

ESTs 3.8 

hypothetical protein FU22418 3.8 
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443162 
458194 
422475 
440705 
447290 
403426 
427821 
454288 
443801 
410658 
410672 
428579 
445495 
447995 
401747 
420633 
423545 
'433138 
434715 
428664 
450951 
402696 
446868 
458154 
422026 
419440 
421524 
417283 
401508 
410303 
420362 
433384 
434302 
443938 
448420 
458712 
433404 
405232 
430491 
455609 
450164 
453948 
436061 
401049 
418867 
420179 
458663 
437259 
428309 
450522 
451952 
412209 
425201 
443830 
439255 
414669 
409064 
407721 
445135 
404091 
409731 
405153 
423248 
403639 
404360 
422352 
423338 



T49951 
AW383618 



AA904244 
AI476732 

M470158 

BE222648 

AW206942 

AW105231 

AW794600 

KNLQ05756 

BE622641 

A1742618 

NM_014581 

AP000692 

AB029496 

BE005346 

AKQ01666 

AA018534 

NA 

AV660737 

AW816379 

U80736 

AB020689 

AA312082 

N62840 

NA 

AA324597 

U79734 

AI021992 

AA629065 

R55373 

BE623004 

AI347502 

T32982 

AL109791 
BE011668 
A1239923 
AI970797 
AI248584 
NA 

D31771 

N74530 

AV658444 

A1377755 

M97815 

AI698839 

AL120173 

AW901456 

AA352111 

AI142095 

BE164500 

AA157291 

AA062954 

Y12735 

AK00OO54 

NA 

AA125985 

AA380177 
NA 

AA766296 
AB007961 



Hs.9029 
Hs.265459 
Hs.117313 
Hs.153205 
Hs.263912 

Hs.98202 
Hs.279458 
Hs.253594 
Hs.192035 

Hs.184942 
Hs.38489 
Hs.181733 

Hs.274480 
Hs.129781 
Hs^9729 
Hs.116410 
Hs.189095 
Hs.103334 

Ks.135100 
Hs.335018 
Hs.1 10826 
Hs.90419 
Hs.105445 
Hs.48648 

Hs.21851 
Hs.97206 
Hs.124244 
Hs.116301 
Hs.20864 

Hs.107872 
Hs.102720 

Hs.241559 

Hs.30098 
Hs.64859 
Hs.190745 

Hs.89404 

Hs.21168 

Hs.280776 

Hs.120695 

Hs.183650 

Hs.301663 



DKF2P434G032 protein 3.8 

ESTs^eratefysiniilartoAUJZJiaM^A 3.8 

Meis(mouse)homolog3 3.8 

ESTs 3.8 

ESTs 3.8 

Target Exon 3.8 

ESTs 3.8 

ESTs, Highly similar to c38QA1.1b [Rsap 3.8 

ESTs 3.8 

ESTs 3.8 
gb«C6-UM0014-17030(M)22-C05 UM0014 Homo3.B 

G protein-coupled receptor 64 3.8 

ESTs, WeaWy similar to 138022 hypofheti 3.8 

ESTs. Weakly similar to nitrflase homoto 3.7 

Homo sapiens keratin 17 (KRT17) 3.7 

odorant-binding protein 2A 37 

chromosome 21 open reading frame 5 3.7 

semaphorin sem2 3.7 

ESTs 3.7 

similar to SALL1 (sal (DrosophBaHike 3.7 

ESTs 3.7 
C3002523:gl|668621 1 |sp]Q27533|YH2NLCAEEL3.7 

ESTs 3.7 

ESTs 3.7 

trinucleotide repeat containing 9 3.7 

KIAAQ882 protein 3.7 

GDNF family receptor alpha 1 3.7 

ESTs 3.7 

NMJ)24817:Homo sapiens hypothetical prat 3.7 

Homo sapiens cDNA FU12900 fis, clone NT 3.7 

hunfingtin interacting protein 1 3.7 

ESTs 3.7 

ESTs 3.7 

ESTs 3.7 
gb.-601441282F1 NIHJWGC 72 Homo sapiens c 3.7 

hypothetical protein FU20761 3,7 

ESTs 3.7 

NMJ)15832:Homo sapiens methyl-CpG bindln 3.7 

Homo sapiens mRNA full length insert cON 3.7 
gb:CM3-BN0223-100500-177-a04 BN0223 Homo3.7 



Hs.143273 

Hs.21479 
Hs.141883 
Hs.38018 
Hs.1 2347 

Hs.56145 

Hs.125845 



ESTs 3.7 
ESTs 3.7 
Homo sapiens cDNA; FU21326 fis, clone C 3.7 
Target Exon 3.6 
msh (Drosophlla) homeo box homo Jog 2 3.6 
ESTs 3.6 
tankyrase. TRFI-lnteracting ankyrin-rela 3.6 
ESTs 3.6 
cellular retinoic acid-binding protein 2 3.6 
gbwd31»2jc1 Sceres_NFLT._GBC_S1 Homos3.6 
ESTs 3.6 
gb:RC()-NN1012-270300-031-c07 NN1012 Homo3.6 
gbf ST60061 Activated T-ceils XX Homo sa 3.6 
ESTs 3.6 
gb^C44fT0469-230300-014-e10HT0469Homo3.6 



ubinudein 1 
ESTs 

dual-specificity tyrosme-(Y)-phosphory! 
hypothetical protein RJ20047 
Target Exon 

thymosin, beta, Identified in neuroblast 
Target Exon 



Hs.99200 
Hs.1 27338 



ENSPO00O0233O23*:CDNA FU 12662 fis, don 
C7001385:gl|12082809|gb|AAG48618.1|AF315 
ESTs 

KIAA0492 protein 
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3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
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424202 
431750 
439907 
453596 
406446 
418454 
434360 
409079 
440132 
446706 
440671 
407647 
459023 
402820 
417009 
447048 
449978 
428062 
452909 
400610 
417843 
419335 
451592 
443270 
423948 
449424 
423841 
416808 
422060 
433104 
415778 
413054 
416636 
424639 
424827 
437782 
411514 
413783 
421106 
431291 
440623 
455838 
458771 
442942 
436550 
418849 
424420 
430916 
432030 
439405 
405917 
452727 
426320 
421070 
424625 
428508 
455651 
410555 
447754 
418636 
404097 
434205 
421072 
402421 
405248 
407638 
403000 



BE350295 
AA514986 
AA853978 
AA441838 
NA 

AA315308 

AW015415 

W87707 

AI697121 

AW291095 

AW297920 

AW860158 

AW968226 

AA191719 

AW393080 

AI806335 

AA420683 

NMJJ15368 

NA 

W07361 

AW960146 

AI805416 

NMJ>04272 

AW392342 

AW448937 

AW753967 

NMJ300288 

R20893 

AU043002 

H84847 

AW316843 

N32536 

AI917494 

AI057094 

AI370876 

AW850176 

AA314337 

AA877124 

N25521 

A1935016 

BE145808 

AW295151 

AW167087 

Z50158 

AW474547 

BE614743 

AW505021 

A1908400 

AF086224 

NA 

AW993582 

W47595 

AA2831B5 

AW904466 

BE252383 

BE064962 

U92649 

AW073310 

AW749B55 

NA 

AF 119861 
AI215069 
NA 
NA 

AJ404672 
BE247275 



Hs.15032 
Hs.283705 
Hs.124577 
Hs.62905 

Hs.195870 
Hs.127780 
Hs.82065 
Hs.202466 
Hs.21814 
Hs.130054 

Hs.60798 

Hs.314714 
Hs.228320 
Hs.200829 
Hs.98321 
Hs.30985 

Hs.22545 
Hs.284137 
Hs.213897 
Hs.337737 
Hs.283077 
Hs.197030 

Hs.79993 
Hs.325823 
Hs.128246 
Hs.49391 
Hs.66309 
Hs.42645 
Hs.9812 
Hs.96867 
Hs.79090 

Hs.301547 
Hs.172844 
Hs.25275 
Hs.216639 

Hs.163612 
Hs.131562 
Hs.270235 
Hs.53565 
Hs.146688 
Hs.88414 
Hs.143789 
Hs.55238 

Hs.176220 
Hs.169300 
Hs.19327 
Hs.321197 
Ks.184668 

Hs.64311 
Hs.163533 



Hs.283032 
Hs.89113 



Hs.334483 
Hs.151787 



RAN binding protein 17 3.6 

ESTs 3.6 

ESTs 3.6 

hypothetical protein FU14834 3.6 

Target Exon 3.6 

hypothetical protein aj14991 3.6 

ESTs 3.6 

Interieukin 6 signal transducer (gp130, 3.6 

ESTs, Weakly similar to S65824 reverse t 3.6 

interleukin 20 receptor, alpha 3.6 

ESTs 3.5 
gb:RCm0379-290100432-bQ4 CT0379 Homo3.5 

ESTs 3.5 

NrVL017646*:Homo sapiens tRNA tsopentenyl 3.5 

ESTs 3.5 

hypotheticat protein RJ23537 3.5 

ESTs, Weakly similar to T30171 ninein - 3.5 

hypothetical protein FU14103 3.5 

pannexin 1 3.5 

Target Exon 3.5 

Homo sapiens cONA FU 12935 fis, clone NT 3.5 

hypothetical protein FU12B88 3.5 

ESTs 3.5 

Homer, neuronal immediate early gene, 1B 3.5 

centrosoma! P4.1-associated protein; unc 3.5 

ESTs 3.5 
gb:RC2-CT03044J801(MW)11-h12 CT0304 Homo 3.5 

peroxisomal biogenesis factor 7 3.5 

ESTs, Moderately similar to ALUS JWMAN A 3.5 

ESTs, Moderately similar to unnamed prot 35 

hypothetical protein LOC54149 3.5 

hypothetical protein MGC11061 / 3.5 

solute carrier family 16 (monocarboxyOc 3.5 

Homo sapiens cDNA RJ14388 fis, clone HE 3.5 

Homo sapiens cDNA: FU231 55 fis, done L 3.5 

exportin 1 (CRM1, yeast, homolog) 3.5 

gb:IL3^TT)219-271099-022-H12CT0219Homo 3.5 

ribosomal protein S7 3.5 

ESTs 3.5 

KruppeMype zinc finger protein 3.5 

ESTs 3.5 
gb:MRO-HT0208-101299-103-f11 HT0208 Homo 3.5 

ESTs 3.5 

ESTs 3.5 

ESTs, Weakly similar to MMHUB1 tarninln b 3.5 

Homo sapiens PIG-M mRNA for mannosyltran 3.5 

prostaglandin E synthase 3.5 

BTB and CNC homology 1 , basic leucine zi 35 

ESTs 3.5 

ESTs 3.5 

C17000675xji|7290703|gb|AAF46150.1| (AE0 3.5 

ESTs 3.5 

transforming growth factor, beta 2 3.4 

ESTs 3.4 

PDZ domain protein {Drosophaa inaWDce 3.4 

SBBI31 protein 3.4 
gbrRC1-BTO13-13Q40M16-c02 BT0313 Homo 3.4 

a dtslntegrin and metaQoproteinase doma 3.4 

Homo sapiens cDNA FU14142 fis, clone MA 3.4 
gb:QV4-BT0534-281299-053-c05 BT0534 Homo 3.4 

C5(W0242*xjll9369379|gb|AAF87128.1|AC006 3.4 

hypothetical protein PRO2015 3.4 

ESTs 3.4 

C1001578*:gi|6759903|gb|AAF28099.1| (AF1 3.4 

Target Exon 3.4 

hypothetical proteki RJ23571 3.4 

U5 snRNP-speciftc protein, 116 M) 3.4 
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433393 AF038564 Hs.98074 itchy (mouse homolog) E3 ublquiGn prota 3.4 

432239 X81334 Hs.2936 matrix metolloproteinase 13 (coilageriase 3.4 

458747 BE618395 Hs.257391 hypothetical protein DKFZp76U1523 3.4 

442082 R41823 Hs.7413 ESTs; caIsyntenin-2 3.4 
5 417974 AA210765 gb2f90c05.r1 NCLCGAPJ5CB1 Homo sapiens 3.4 

446002 AI346468 Hs.145789 ESTs 3.4 

448995 AI613276 Hs.5662 guanine nucleotide binding protein (Gpr 3.4 

435007 A1247716 Hs^32168 ESTs 3.4 

424698 AA164366 Hs.151973 hypofoetlcal protein FU23511 3.4 

10 435202 AI971313 Hs.170204 K1AA0551 protein 3.4 

410467 AF102546 Hs.63931 dachshund (Drosophila) homolog 34 

405460 NA Target Exon 34 

441826 AW503603 Hs.129915 phosphotrlesterase related 34 

453472 AL037925 gb:DKFZp564M037_r1 564 (synonym: hfbf2) 34 

15 447078 AW885727 Hs.301570 ESTs 34 

441690 R81733 Hs.33106 ESTs 34 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cyciin-dependent kinase inh2)itor2A {me 3.3 

408903 BE296227 Hs350822 serine/threonine kinase 15 3.3 

20 414737 AI160386 Hs.125087 ESTs 34 

449650 AF055575 Ks^3838 calcium channel, voltage-dependent L ty 34 

418912 NM.000685 Hs.89472 angiotensin receptor 1 34 

436405 AA160079 Hs.172932 Homo sapiens mRNA for partial 3*UTR, seq 34 

453911 AW503857 Hs.4007 Sarcotemrnai-assodated protein 34 

25 409361 NMJ005982 Hs.54416 sine oculis homeobox (Drosophila) homolo 34 

429548 AW138372 Hs.135288 ESTs 34 

420807 AA280627 Hs.57846 ESTs 34 

409695 AA296961 gb:EST112514 Adrenal gland tumor Homo sa 34 

445189 AI936450 Hs.147482 ESTs 34 

30 402892 NA Target Exon 34 

426681 AA994896 Hs.22514 ESTs 34 

458722 AA741545 Hs.282832 ESTs, Weakly similar to T24961 hypothetl 3.3 

409430 R21945 Hs.166975 splicing factor, ajginlne/serine-rich 5 34 

443194 AI954968 Hs.279009 matrix Gla protain 3.3 

35 445432 AV653771 gbAV653771 GLC Homo sapiens cDNA clone 3.3 

410908 AA121686 Hs.10592 ESTs 34 

406151 NA Target Exon 34 

436461 AW511956 Hs.293261 ESTs 34 
41 1 171 AWB20260 gb:QV2-ST0296-150200-040x:10 ST0296 Homo 34 

40 432415 T16971 Hs^89014 ESTs 1 WeaWysimDartoA43932rnucin2p 3.3 

439310 AF086120 Hs.102793 ESTs 34 

401575 NA Target Exon 34 

420900 AL045633 Hs.44269 ESTs 34 

445628 AI344166 Hs.155743 ESTs 34 

45 448243 AW369771 Hs.52620 integrin. beta 8 3.3 

445102 AW204610 Hs.22270 ESTs 3.3 

442118 AA976718 Hs.202242 ESTs 34 

416783 AA206186 Hs79889 monocyte to macrophage differentiatjon-a 34 

435039 AW043921 Hs.130526 ESTs 34 

50 451474 T70874 Hs.207636 ESTs 3.2 

442559 T10213 Hs.159993 gycosyltransferase 3.2 

453921 AI824009 Hs.44577 ESTs 3.2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDNA DKFZp586l1823 (f 3.2 

435627 W88774 Hs.118370 ESTs 3.2 

55 411598 BE336654 Hs.70937 H3 histone family, member A 3.2 

446733 AA863360 Hs.26040 ESTs, Weakly similar to fatty acid omega 3.2 

410153 BE311926 Hs.15830 hypothetical protein FL) 12691 3.2 

403637 NA C3001 106*:gi|1 0047201 |dbjJBAB1 3394.1) (A 3.2 

405547 NM_018833*:Homo sapiens transporter 2, A 3.2 

60 427878 C05766 Hs.181022 CGI-07 protein 3.2 

451871 AJ821005 Hs.118599 ESTs 3.2 

410313 R10305 Hs.185683 ESTs 3.2 

416856 N27833 Hs^69028 ESTs, Weakly similar to 138022 hypotheti 3.2 

449490 AI652777 Hs.197069 ESTs 3.2 

65 450506 NM.004460 Hs.418 fibroblast activation protein, alpha 3.2 

440684 AI253123 Hs.127356 ESTs, Highly similar to S21424 nestln (H 22 

459055 N23235 Hs40567 ESTs, Weakly similar to B34087 hypofiiefl 3.2 
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452190 H26735 
430965 AA469732 
405394 

424693 BE169810 
454265 H03556 
437687 AA765917 
428372 AK000684 
414083 AL121282 
411670 AW856552 
416283 NMJ305429 
437488 AA758239 
428398 AI249368 
452042 H38857 
421477 AI904743 
438078 AI016377 
448816 AB033052 
419519 AI198719 
404580 

447046 AA326187 
457473 AW974903 
429838 AW904907 
459702 AI204995 
400195 NA 
417860 AW408557 
417995 AW974175 
422589 AA312735 
435870 AA701327 
440801 AA906366 
426274 D38122 
423728 AW891294 
439677 R82331 
452834 AI638627 
431349 AA503653 
417576 AA339449 
430264 AA470519 
418827 BE327311 
410835 AW806906 
426269 H15302 
405336 NA 
437783 AI683150 
440931 AI583052 
455945 BE160636 
430437 AI768801 
405848 NA 
455685 8E066976 
406970 M29994 
409602 W26713 
423518 D45027 
425653 AI065104 
426326 BE165753 
433805 AA706910 
437152 AU050027 
448602 AI541305 
452844 AW407181 
407366 AF026942 
408254 AW807227 
424085 NM.002914 
416790 R83066 
420020 BE295866 
426119 W94997 
426968 U07616 
457421 AL1 17431 
453403 BE466639 
454141 AW138413 
426650 AA382814 
450865 AI246013 
407993 AW135274 



Hs.91668 Homo sapiens done PP1498 unknown mRNA 3.2 

Hs.154918 ESTs 3.2 

Target Exon 3.2 

Hs.47557 ESTs 3.2 

Hs.300949 ESTs, Weakly similar to thyroid hormone 3.2 

Hs.122840 ESTs ' 3.2 

Hs.1 83887 hypothetical protein BJ22104 3.2 

Hs.257786 ESTs 3.2 

gb:F.C1-CT0294-0801QQ412-a04 CT0294 Homo 3.2 

Hs.79141 vascular endothelial growth factor C 3.2 

Hs.180330 ESTs 3.2 

Hs.98558 ESTs 3.2 

Hs.243901 Homo sapiens cDNA RJ20738 (is, clone HE 3.2 

Hs.104650 hypothetical protein FU10292 3.2 

Hs.131693 ESTs 3.2 

Hs.22151 K1M1226 protein 3.2 

Hs.1 76376 ESTs 3.2 

NM.014 1 12*:Homo sapiens trichominophala 3.2 

Hs,17170 G protein-coupled receptor 4 3.2 

Hs.291231 ESTs 3.1 

Hs.30732 hypothetical protein FU13409;KtAA1711 3.1 

gb:an03c03j(1 Stratagene schizo brain S 1 3. 1 

NNL0Q7G57*:Horno sapiens ZW10 interactor 3.1 

Hs.235498 hypothetical protein FU14075 3.1 
Hs.188751 ESTs, Weakly similar to MAPB_HUMAN MICRO 3.1 

Hs.30512 Homo sapiens mRNA for KIAA0556 protein, 3.1 

Hs.17949 ESTs 3.1 

Hs.190535 ESTs 3.1 

Hs.2007 tumor necrosis factor (Ugand) superfami 3.1 

Hs,132136 solute carrier family 4, sodium bicarbon 3.1 

Hs.164599 ESTs 3.1 

Hs.105685 KJAA1688 protein 3.1 

Hs.156942 ESTs, Moderately similar to ALU2JWMAN A 3.1 

Hs.82285 phosphoribosylglydnamida formyttransfer 3.1 

gb:nc71f10.s1 NC!_CGAP_Pr1 Homo sapiens 3.1 

Hs.47166 HT021 3.1 

gb:QV4-ST0023-160400-172-d12 ST0023 Homo 3.1 
Hs.168950 Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 

Target Exon 3.1 

Hs.201550 ESTs, Weakly similar to ALU1JHUMAN ALUS 3.1 

Hs.270058 ESTs 3.1 

gb:PM1 W0422-291299-002-C08 HT0422 Homo 3.1 

Hs.169943 Homo sapiens cDNA FU13569 lis, done PL 3.1 

Target Exon 3.1 

gb:PM0-BTO340-211299403-c12 BT0340 Homo 3.1 

gb:Human alpha-l spectrin gene, exon 12. 3.1 

Hs.256972 ESTs 3.1 

Hs.129732 R3H domain (binds single-stranded nudei 3.1 

Hs.249718 ESTs.WeaWysimflartoA46010X^nked 3.1 

Hs.250528 Homo sapiens, done IMAGE:4098694, mRNA, 3.1 

Hs.112742 ESTs 3.1 

gb:Homo sapiens mRNA; cDNA DKFZp566C03243.1 

Hs. 48778 niban protein 3.1 

Hs.218377 Homo sapiens cDNA FU1 1927 fis. done HE 3.1 

gb:Homo sapiens clg33 mRNA, partial sequ 3.1 

gb:MR4-ST0062-1802Qr>001«10 ST0062 Homo 3.1 

Hs.139226 replication factor C (activator 1)2 (40 3.1 

Hs.7043 succinate-CoA ligase, GDP-formlng, alpha 3.1 

Hs.94382 adenosine kinase 3.1 

Hs.189917 ESTs 3.1 

Hs.173034 airiphIphysm(Stin^^syrulromewithbr 3.1 

Hs.1 12165 Homo sapiens cDNA RJ12198 fis, done MA 3.1 

Hs.61779 Homo sapiens cDNA FU13591 fis, done PL 3.1 

Hs.139336 ATP-bbding cassette, suWamily C (CFTR 3.1 

gbf ST96097 Testis I Homo sapiens cDNA 5 3.1 

Hs.106532 ESTs, Weakly similar to 138588 reverse t 3.1 

Hs.12433 ESTs 3.1 
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446466 H38026 Hs.308 arrestin 3, retinal (X-arrestin) 3.1 

457888 BE219794 Hs.293471 ESTs 3.1 

420058 AK001423 Hs.94694 Homo sapiens cONA RJ10561 6s, done NT 3.0 

409248 AB033035 Hs.51965 KIAA1209 protein 3.0 

5 452747 BE153855 Hs.61460 IgsuperfamBy receptor WIR 3.0 

418926 AA232658 Hs.105794 UDP^tucose^lycoprotelngkjcosyttransfe 3.0 

419346 AI830417 Hs.44143 polybromol 3.0 

429826 N93266 Hs.40747 ESTs 3.0 

435147 AL133731 Hs.4774 Hornosaj)iensmRNA;cDNADKFZp761C1712(f3.0 

10 420139 NM_005357 Hs.95351 lipase, hormone^enslfive 3.0 

405609 NA ENSP0000024106V£DNA 3.0 

404274 NAfl_002944*:Homo sapiens v-ros avian UR2 3.0 

449777 AI971362 Hs.231945 ESTs 3.0 

415459 H07118 Hs.6099 ESTs 3.0 

15 415245 N59650 Hs.27252 ESTs 3.0 

406291 NA Target Exon 3.0 
414210 BE383592 gb:601297871F1 NIH_MGCJ9 Homo sapiens c 3.0 

432055 AW972359 Hs.293334 ESTs 3.0 

442246 AI791988 Hs.129115 ESTs 3.0 

20 451353 N21043 Hs.42932 ESTs 3.0 

451177 AI969716 Hs.13034 ESTs 3.0 

418026 BE379727 Hs.83213 fatty add binding protein 4, adipocyte 3.0 

401326 NA C10000447*:gi|1168375(sp|P43467|AGA1J J ED 3.0 

409920 BE169746 Hs.12504 likely ortholog of mouse Arkadia 3.0 

25 432887 AI926047 Hs.162859 ESTs 3.0 

411789 AF245505 Hs.72157 DKFZP5641 1922 protein 3.0 

401045 C11001883*:gil6753278|refiNP.033938.1| c 3.0 

432162 AA584062 Hs.272798 hypothetical protein RJ20413 3.0 

434627 AI221894 Hs.39311 ESTs 3.0 

30 442611 8E077155 Hs.177537 hypothetical protein DKFZp761B1514 3.0 

425477 AW958879 Hs.270535 ESTs 3.0 

457183 H91882 Hs.1 18569 Dvt-binding protBin IDAX (inhibition of 3.0 

433014 NtVL014711 Hs.279912 KIAA0419 gene product 3.0 

415542 R13474 Hs.290263 ESTs, Weakly similar to I38022 hypothetl 3.0 

35 416173 R52782 gbryg99d09.r1 Scares Infant brain 1 NIB H 3.0 

408155 AB014528 Hs.43133 K1AA0628 gene product 3.0 

453913 AW004683 Hs.78934 mutS (E. cofi) homolog 2 {colon cancer, 3.0 

435495 AI754212 Hs.21951 Homo sapiens Xq pseudoautosomal region; 3.0 

423629 AW021173 Hs.18612 Homo sapiens cONA: FU21909 fis, done H 3.0 

40 411836 AW901879 Hs.314453 ESTs 3.0 

415030 D31118 Hs.191735 hypothetical protein MGC1 0520 3.0 

419606 AW294795 Hs.198529 ESTs 3.0 

440310 AA878939 Hs.125406 ESTs 3.0 

443608 AI375957 Hs.289074 F-box only protein 22 3.0 

45 414140 AA281279 Hs.23317 hypothetical protein FU14681 3.0 

444781 NM_0 14400 Hs. 11950 GPhanchored metastasis-associated prate 3.0 

435393 AA701259 Hs.189299 ESTs 3.0 

454071 AI041793 Hs.42502 ESTs 3.0 
446922 BE175605 gb^C5W0580-1005(XW22^07HT0580Homo3.0 

50 448062 AW295923 Hs.255472 KIAA1843 protein 3.0 

427711 M31659 Hs. 180403 solute carrier famQy 25 (mitochondrial 3.0 

450382 AA397658 Hs.60257 Homo sapiens cONA FU13598 fis, done PL 3.0 

424866 W01938 Hs.337243 ESTs, Weakly similar to ALU7_HUMAN ALUS 2.9 

433043 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 2.9 

55 420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 2.9 

445625 BE246743 Hs.288529 hypothetical protein FU22635 2,9 

403677 NA C4001462:gP87715|gb|AAA79329^|P88 2.9 
411093 BE067650 gb:MR4-BT0358^9030{M)03-e01 BT0358 Homo 2.9 

435255 W87434 Hs.106015 ESTs, Moderately similar to ALU1.HUMAN A 2.9 

60 443127 BE568102 Hs.180312 mitochondrial ribosomal protein S16 2.9 

448104 AI674818 Hs.316433 Homo sapiens cDNA FU11375 fis. done HE 2.9 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cONA DKFZp564N0763 {f 2.9 

430414 AW365665 Hs.120388 ESTs 2.9 

423600 AI633559 Hs.310359 ESTs 2.9 

65 458562 N34128 Hs.145268 ESTs 2.9 

402109 NA Target Exon • 2J9 

429629 BE501732 Hs.30622 Homo sapiens cDNA RJ13010 fis, clone NT 2.9 
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442295 AI827248 
419752 AA249573 
404721 

445107 AJ208121 
401987 

43G5GG AA431282 
444517 A1939339 
445553 AW873606 
427631 AW194425 
456561 AI86B634 
401458 

421039 NMJKJ3478 
459504 BE514127 
424962 NM.012288 
409617 BE003760 
416931 045371 
413221 BE161151 
409732 NM.016122 
433687 AA743991 
434340 AI193043 
454529 Z45439 
421379 Y15221 
452291 AF015592 
457402 AW452648 
449051 AW961400 
408761 AA057264 
401093 

435061 A1651474 
447985 A1681475 
449340 AW235786 
426384 AI472078 
411905 BE265067 
405953 NA 
420854 AW295927 
434265 AA846811 
428365 AA295331 
442861 AA243837 
448337 AW206453 
452554 AW452434 
412248 BE176480 
449450 AL039852 
434757 A1038997 
409038 T97490 
454545 AW806899 
439842 AI910896 
428479 Y00272 
424800 AL035588 
411086 BE070800 
400250 NA 
449168 NMJ16206 
456482 AA485224 
426044 AA502490 
431854 AA383550 
405873 NA 
440400 AA994364 
458265 A1075375 
413708 BE156791 
423739 AA398155 
424408 AI754813 
453096 AW294631 
421825 AA298758 
417742 R64719 
402765 

444378 R41339 
419172 AW338625 
401497 
402376 



Hs.224398 Homo sapiens cDNA FU11469 fis, done HE 2.9 
Hs.152618 ESTs, Moderately similar to ZN91.HUMAN Z 2.9 
NMJ)05596*:Homo sapiens nuclear factor I 2.9 
Hs.147313 ESTs, Weakly similar to I38022 hypolheti 2.9 
NWL002737*:Homo sapiens protein kinase C 2.9 
Hs.190143 ESTs 2.9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs, Weakly similar to T32250 hypothetl 19 
Target Exon 2.9 
Hs.101299 culiinS 19 
gb:601315974F1 NlH_MGC_8 Homo sapiens cD 2.9 
Hs.153954 TRAM-fike protein 2.9 
Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM&HT0425-141299401-F08 HT0425 Homo2.9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1 NCLCGAP_Pr18 Homo sapiens 2.9 
Hs.128685 ESTs, Weakly similar to T17226 hypolheti 2.9 
Hs.270425 ESTs 2.9 
Hs.103982 small inducible cytokine subfamily B (Cy 2.9 
Hs.28853 CDC 7 (cell division cycle 7, S. cerevtsi 2.9 
Hs.149342 activation-induced cytidine deaminase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, Z9 
Hs.238936 ESTs, WeaWy similar to (defBne not ava . 2.9 
C12000586*:gi|6330167|dbilBAA86477.1| (A Z9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 2.9 
Hs.195359 hypothetical protein MGC10954 2.9 
Hs.303662 ESTs 2.9 
gb:601 193893F1 NIH J/IGC J Homo sapiens cO 2.9 
Target Exon 2.8 
gb:UI-H-8W0-ajw>07-(MJI.s1 NCLCGAP.Su 2.8 
Hs.130554 Homo sapiens cDNA: FU23089 fis, done L 2.8 
Hs.183861 Homo sapiens cDNA RJ20042 fis, done CO 2.8 
Hs.57787 ESTs 2.8 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs, Weakly similar to ALU5.HUMAN ALUS 2.8 
gb:RC3-HT0585-1603CKH)22-c02 HT0585 Homo 2.8 
Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 2.8 
Hs.132921 ESTs 2.8 
Hs.50002 smaB Inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs. 184572 cell division cycle 2, G1 to S and G2 to 2.8 
Hs.153203 MyoDfamDy inhibitor 2.8 
gfcRC3-ST0502-251 199-01 1-c07 BT0502 Homo U 
Eos Control 2& 
colon carcinoma related protein 2.8 
gbaa41b12.s1 NCLCGAPJ5CB1 Homo sapiens2.8 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.11325 
Hs.183747 



Hs.12569 
Hs.22120 



ESTs 2.8 

polymerase (DNA directed) iota 2.8 

Target Exon 2.8 

ESTs, Weakly similar to T25472 hypolheti 2.8 

ESTs, Weakly similar to IRX2_HUMAN IROQU 2.8 

gb:lL2-HT0397-091299«O254502HT0397Homo Z8 

ESTs U 

collagen, type V, alpha 1 2.8 

ESTs 2.8 

ESTs, Moderately similar to CALBJHUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cDNA don 18 

C1003621*:gi|124O74O5Igb)AAG53491,1|AF22 2J& 

ESTs 28 

ESTs 2.8 

Target Exon 2.8 

C19000763*xji|1363912|plrl|JC4296 ring f 2.8 
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405041 NA C3001706*:gl|1345652lspIP15989|CA36_CHlC 2.8 

408758 NMJJ03686 Hs.47504 exonucteasel 2.8 

431917 D16181 Hs.2868 peripheral myeRn protein 2 2.8 

437583 AA761190 Hs.244627 ESTs U 

5 453737 AA744862 Hs.194293 ESTs, Weakly similar to 154374 gene NF2 2.8 

458094 AF086325 gfcHomo sapiens full length insert cDNA 2.8 

401283 NA Target Exon 2.8 

410784 AW803201 gb:IL24JM0O77-070500-O8G-E06 UM0077 Homo 2.8 

417601 NM.014735 Hs.82292 KIAA0215 gene product 2.8 

10 418236 AW994005 Hs.337534 ESTs 2.8 

435532 AW291488 Hs.1 17305 Homo sapiens, clone IMAGE:3682908, mRNA 2.8 
454714 AW815098 gb:QV4^ST0212-091199-023^10ST0212Homo 2.8 

418629 BE247550 Hs.86859 growth factor receptor-bound protein 7 2.8 

442101 AI651930 Hs.135684 ESTs 2.8 

15 405080 AK000375 HS.QBZ2Q HOCMC28P protein 2.8 

414661 T97401 Hs.21929 ESTs 2.8 

425589 AI850633 Hs.143688 Homo sapiens cDNA: FU23031 fls, clone L 2.8 

429638 AI916662 Hs.211577 klnectin 1 (kinesin receptor) 2.7 

428824 W23624 Hs.173059 ESTs 2.7 

20 421566 NWL000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 2.7 
414596 BE386870 gb:6Q1275271F1 NIH J/1GCL20 Homo sapiens c 2.7 

440868 R79707 Hs.263339 ESTs, Moderately simBar to I38022 hypot 2.7 

452943 BE247449 Hs.31082 hypothetical protein FU 10525 2.7 

443772 AV646449 Hs.282872 ESTs 2.7 

25 432361 A1378562 Hs.159585 ESTs 2.7 

430375 AW371048 Hs.93758 H4 histone family, member H 2.7 

406504 NA C5W)0558:gll4504675|reflNPJD02175.1|lnt 2.7 

423279 AW959861 Hs.290943 ESTs 2.7 

424871 NWL0G4525 Hs.153595 low density lipoproteln-reiated protein 2.7 

30 453619 H87648 Hs.33922 Homo sapiens, clone MGC:9084, mRNA, comp 2.7 

423961 D13666 Hs. 136348 osteoblast specific factor 2 (fascicOn 2.7 

422156 N34524 gb:yy56d10.s1 Soares_rnulfipte_sderosis_ 2.7 

415752 BE314524 Hs.78776 putative transmembrane protein 2.7 

419987 NMJ05014 Hs.94070 osteomodulin 2.7 

35 406182 NA Target Exon 2.7 

416495 X69970 Hs.79350 RYK receptor-like tyrosine kinase 2.7 

444701 AI916512 Hs.198394 ESTs ■ 2.7 

408171 AA301228 Hs.43299 hypothetical protein FL)12890 2.7 

430153 AW968128 Hs.336679 ESTs 2.7 

40 413383 AA128978 Hs.154706 hypofoetica! protein FU14917 2.7 

414831 M31158 Hs.77439 protein kinase, cAMP-dependent. reguiato 2.7 
413278 BE563Q85 Hs.833 foterferon-sfimulated protein, 15 kDa 2.7 
433132 AB026264 Hs.284245 hypothetical protein IMPACT 2.7 

437030 AA742577 Hs.303781 EST 2.7 

45 439031 AF075079 gbrHomo sapiens fufl length Insert cONA 2.7 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cel 2.7 
406153 Target Exon 2.7 

406625 Y13647 Hs.119597 stearoyl-CoA desaturase (delta-9-desatur 2.7 
444698 A1188139 Hs.147050 ESTs 2.7 

50 432328 AI572739 Hs.195471 6i>TK)sprtt)fructr>24dnase/niJCtDse-2,^ 2J 
429628 H09604 Hs.13268 ESTs 2.7 
420149 AA255920 Hs.88095 ESTs 2.7 
431207 AA495925 Hs.9394 ESTs 2.7 
438394 BE379623 Hs.27693 peptidylprolyl isomerase (cyclophillnH 2.7 

55 443304 A1050073 Hs.135338 ESTs 2.7 
427660 AI741320 Hs.114121 Homo sapiens cDNA: FU23228 fis, done C 2.7 
408460 AA054726 Hs.285574 ESTs 2.7 
416515 N91716 Hs.194140 ESTs, WeaWy similar to 138022 hypothefi 2.7 
429922 297630 Hs.2261 17 H1 histone family, member 0 2.7 

60 418203 X54942 Hs.83758 CDC28 protein kinase 2 2.7 
439509 AF086332 Hs^8314 ESTs 2.7 
402184 NA ENSP00000245238*:CONAFU10922fis,cton 2.7 

450496 AW449251 Hs.257131 ESTs 2.7 
451963 AI825440 Hs.224952 ESTs 2.7 

65 457938 AI373638 Hs.133900 ESTs 2.7 
441541 AA938663 Hs.199828 ESTs 2.7 
441111 AI806867 Hs.126594 ESTs 2.7 
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423020 
445354 
427981 
410889 
445234 
413903 
406069 
447410 
401256 
415139 
420218 
455511 
438825 
409564 
452837 
434876 
421565 
453279 
430785 
456986 
433068 
421952 
429208 
430733 
441720 
418986 
432481 
434338 
417061 
410530 
456672 
425071 



451531 
405822 
418301 
417315 
434699 
443204 
405636 
452542 
403943 
404535 
402800 
449144 
454934 
424717 
428303 
427970 
450638 
453034 
455097 
427317 
408875 
427510 
423201 
406271 



454018 
435420 
434398 
455708 
439347 
407523 
425101 
435153 
409139 



AA383092 Hs.1608 replication protein A3 (14kD) 2.7 

AV653485 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, 2.7 

AW293165 Hs.143134 ESTs 27 

X91662 Hs.66744 twist (Drosophila) homolog (acrocephatos 2.7 

AW137636 Hs.146059 ESTs 2.7 

AA496493 Hs.23136 ESTs 2.7 

NA Target Exon 2.7 

AI470235 Hs.172698 EST 2.7 

NA NM_024G89*:Homa sapiens hypothetical pro 2.7 

AW975942 Hs.48524 ESTs 2.7 

AW958037 Hs.286 ribosomat protein 1.4 2.7 
BE144762 gb:CM0-HT018(W4109^065-o04 HT0180 Homo2.7 

BE327427 Hs.79953 ESTs 2.6 

AA045857 Hs.54943 fracture caHus 1 (rat) homolog 2.6 

AL121053 Hs.5534 Homo sapiens cDNA FU12961 lis, done NT 2.6 

AF160477 Hs.61460 Ig superfamily receptor LNIR 2.6 

AK001122 Hs.105859 hypothetical protein FU10260 2.6 

AW893940 Hs.59698 ESTs 2.6 

Z30201 gb:HHEA22G Atrium cDNA library Human hea 2.6 

D38299 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 2.6 

NM.006456 Hs.288215 siaiyitransferase 2.6 

AA300900 Hs.98849 ESTs, Moderately similar to AF161511 1 H 2.6 

AA447990 Hs.190478 ESTs 2.6 

AW975920 Hs.283361 ESTs 2.6 

AI346487 Hs.28739 ESTs 2.6 

A1123555 Hs.81796 ESTs 2.6 

AW451645 Hs.151504 Homo sapiens cDNA FU11973 fis, done HE 2.6 
AW754311 gb^M1-CT0337.1412994)68-f07CT0337 Homo 2.6 

AI675944 Hs.188691 Homo sapiens cONA FU12033 fis, done HE 2.6 

M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 2.6 

AK002016 Hs.1 14727 Homo sapiens, clone MGC:16327, mRNA, com 2.6 

NM.013989 Hs.154424 delodinase, lodolhyronine, type II £6 

AW292286 Hs.255058 ESTs 2.6 

AA018311 Hs.114762 ESTs 2.6 

Target Exon 2.6 

AW976201 Hs.53913 hypothetical protein FU 10252 2.6 

A1080042 Hs.336901 ribosomal protein S24 2.6 

AA643687 Hs.149425 Homo sapiens cDNA FU1 1980 fls, clone HE 2.6 

AW205878 Hs.29643 Homo sapiens cONA FU13103 lis, clone NT 2.6 

Target Exon 2.6 
AW812256 gb;RCO-STO174-191O99-031-aO7 ST0174 Homo 2.6 

C5000355:gi|4503225|reflNP^000765.1) cyt Z6 

Z25884 Hs.121483 chloride channel 1 , skeletal musde (Th 2.6 

NA Target Exon 2.6 

A1989503 Hs.233405 ESTs 2.6 

AW846080 Hs.314324 ESTs 2.6 
H03754 Hs.1 52213 wingless-type MMTV integration site (ami 2.6 

AW974476 Hs.183601 regulator of G-protein signalling 16 2.6 
AA418187 Hs.330515 ESTs 2.6 
AK001826 Hs.25245 hypothetical protein FU 11 269 Z6 
BE246010 Hs.271468 Homo sapiens mRNA tor FU00038 protein, 2.6 
AW655802 gb:RCW0279-17020(W23-d08 CTO279 Homo 2.6 
AB028955 Hs.175780 KIAA1032 protein 2.6 
NMJ)15434 Hs.48604 DKFZP434B168 protein 2.6 
Z47542 Hs.1 79312 smaii nudear RNA activating complex, po 2.6 
NIVL000163 Hs.125180 growth hormone receptor 2.6 

Target Exon 2.6 
BE566962 Hs.7063 Homo sapiens cDNA: FU20913 (is, done A 2.6 
AW016892 Hs.100855 ESTs 2.6 
AI928513 Hs.59203 ESTs 2.6 
AA121096 Hs.3838 serurn-indudble kinase 2.6 
BE069326 gb^V3-BT0381-170100-060^03 BT0381 Homo 2.6 

W24320 Hs.102941 Homo sapiens cDNA: RJ21531 fis. clone C 2.6 
X64984 gtoRsaplens mRNA HTPCRX10 for olfactory 2.6 

AA830431 Hs.180811 ESTs 2.6 
AA668763 Hs.291939 ESTs 2.6 
AI681917 Hs.3321 ESTs, Highly similar to IRX1JRJMAN IROQU 2.6 
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455100 BE160198 gb:QV1-HT04134)1020(W59^3HT0413 Homo 2.6 

414612 BE274552 Hs.76578 protein inhibitor of activated STAT3 2.6 

440283 AI732892 Ks.190489 ESTs 2.6 

423025 AA831267 Hs.12244 hypothetical protein FU20097 2.6 

431473 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 2.6 

404440 NM_021048:Homo sapiens melanoma antigen, 2.6 

403388 NA C3001398*:gi|12248917|db]lBAB20375.1|(A 2.6 

403775 NA Target Exon 2.6 

405037 NA NfvL021628*:Homo sapiens arachidonate Dp 2.6 

407447 AF290544 gb:Homo sapiens aminopeptidase mRNA, par 2.6 

420952 AA282067 Hs.88972 ESTs, Moderately similar to A46010 X-Bn 2.6 

435447 AI872932 gbwm72e03.x1 NCLCGAP_Ut2 Homo sapiens 2.6 

440202 AW516211 Hs.125300 ring finger protein 21, Interferon-respo 2.6 

445854 AI702885 Hs.145568 ESTs 2.6 

421247 BE391727 Hs.10291O general transcription factor IIH, polype 2.6 

414870 N72264 Hs.300670 K1AA1 204 protein 2.6 

457411 AW085961 Hs.130093 ESTs 2.6 

424676 Y08565 Hs.151678 UDP^^ceryUIpha^alaclosamine:polyp 2.6 

404443 C8001428*:gil6572242lemb|CAB62951.1| (Z9 2.6 

452268 NM_003512 Hs.28777 H2A histone family, member L 2.6 

430832 A1073913 Hs.100686 ESTs, Weakly similar to JEQ350 Anterior 2.6 

444779 AI192105 Hs.147170 ESTs 2.6 

408633 AW963372 Hs.46677 PRO2000 protein 2.6 

459089 F13036 Hs.27373 Homo sapiens mRNA; cONA DKFZp56401763 (f 2.6 

447197 R36075 gb:yh88bQ1.s1 Soares pfacenta Nb2HP Homo 2.6 

454111 AW081681 Hs.269064 ESTs, Weakly similar to T42689 hypotheti 2.6 

411165 NM_000169 Hs.69089 gabctosidase, alpha 2.6 

406922 S70284 gbstearoyl-CoA desaturase {human, adipo 2.6 

456045 H62943 Hs.154188 ESTs 2.6 

413111 BE065837 gb:R(2-BTQ318-110100412-g12BTu318 Homo 2.6 

423123 NlvL.012247 Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 

432201 AI538613 Hs.298241 Transmembrane protease, serine 3 2.5 

427032 AF012023 Hs.173274 integrin cytoplasmic domain-associated p 2.5 

445417 AK001058 Hs.12680 Homo sapiens cDNA FU10196 fis, clone HE 25 

A2222S BE245652 Hs.1 18281 zinc finger protein 266 25 

428330 L22524 Hs.2256 matrix metaitoproteinase 7 {MMP7; uterin 2.5 

410011 ABO20641 Hs.57856 PFTA1RE proton kinase 1 2.5 

426310 NNL000909 Hs.169266 neuropeptide Y receptor Y1 2.5 

419589 AW973708 Hs.201925 Homo sapiens cDNA FU13446 fis, clone PL 2.5 

437770 AA767881 Hs.122897 ESTs 25 

442818 AK001741 Hs.8739 hypotheticaf protein FU10879 25 

414251 AL042306 Hs.97689 VASA protein 2.5 

428301 AW628666 Hs.98440 ESTs, Weakly similar to I38022 hypotheti 2.5 

431956 AK002032 Hs.272245 Homo sapiens cDNA FU1 1 170 fis, clone PL 2.5 

455732 BE080908 gb:QV1-BTQ631-28020(W84-h07 BT0631 Homo 2.5 

458624 AI362790 Hs.278639 K1AA1684 protein; likely homolog of mous 2.5 

428257 BE394723 Hs.275243 S 100 calcium-binding protein A6 (calcycl 2.5 

418054 NM.002316 Hs.83354 lysyl oxidase-Gke 2 2.5 

458652 AW375610 Hs.117102 hypofheticaJprotehRJ13046simflarto 2.5 

458012 AI424899 Hs.188211 ESTs 2.5 

422996 BE091089 gb:PM4-BT0724-13040(W)0^c07 BT0724 Homo 25 

410804 U64820 Hs.66521 Machado-Joseph disease (spinocerebellar 25 

457211 AW972565 Hs.32399 ESTs, WeaWy similar to S51797 vasodilat 25 

440029 AW089705 Hs.293711 ESTs, Weakly similar to S64329 probable 2.5 

448141 AI471598 Hs.197531 ESTs 2.5 

409163 AA065081 gb:zm13a03.s1 Stratagene pancreas (93720 2.5 

431385 BE178536 Hs.11090 membrane-spanning 4-domalns, subfamily A 2.5 

414569 AF109298 Hs.1 18258 prostate cancer associated protein 1 2.5 

439963 AW247529 Hs.6793 platelet-activating factor acetythydrola 15 

455935 BE158687 gb:CM04fr039&-280100-169-b09 HT0395 Homo25 

425025 AW953168 Hs.12407 ESTs 2.5 

416589 AA652687 Hs.96151 Human DNA sequence from clone RP5-1 103G7 25 

404826 Target Exon 2.5 

422938 NMJJ01809 Hs.1594 centromere protein A (17kD) 25 

421991 NMJJ14918 Hs.110488 KfAA0990 protein 2.5 

417404 NMJW7350 Hs^2101 pteckstrin rujmology-^e domain, family 15 

448516 AW898595 gb:RC1-NN0073-26040Q-01 1-gQ9 NN0073 Homo25 
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403356 NA ENSP00000251525*:Hypothetlcal protein Kl 2.5 

404983 ENSP00000252242*:KeraGn, type II cytosk 2.5 

418282 AA215535 Hs.98133 ESTs 23 

427409 AW467143 Hs.135411 actin related protein 2.5 

431806 AF186114 Hs.270737 tumor necrosis factor (ligand) superfami 2.5 

443367 AW071349 Hs.215937 ESTs 2,5 

421246 AW582962 Hs.102897 CGl-47 protein 2.5 

439217 AF086041 Hs.42975 ESTs 15 

400925 Target Exon 2.5 
404552 NA ENSP00000220888*2!NC FINGER TRANSCR1PTI2.5 

417168 AL133117 Hs.81376 Homo sapiens mRNA; cDNA DKFZp586L1121 {f 2.5 

418841 NM.002332 Hs.89137 tow density lipoprotein-related protein 2.5 

426853 U32974 Hs.172777 baculovlral IAP repeat-containing 4 2.5 

427738 NM_000318 Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 2.5 

457384 AA501760 Hs.15806 Homo sapiens mRNA; cDNA DKFZp434H2019 (f 2.5 

447128 AI271898 Hs.164866 cydinK 2.5 
454693 AW813428 gb:MR3-ST0192-010200-210-cO5ST0192 Homo 2.5 

434657 AA641876 Hs.191840 ESTs 2.5 

402077 NA Target Exon 2.5 

400289 X07820 Hs.2258 matrix metalloprotetnase 10 (MMP10; str 2.5 

. 409723 AW885757 Hs.257862 ESTs 2.5 

447020 T27308 Hs.16986 hypothetical protein FU1 1046 2.5 

455068 AI807894 Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fr 2.5 

431232 AI024353 Hs.131755 hypothetical protein FU14298 2.5 

408938 AA059013 H^22607 ESTs 2.5 

411571 AA122393 Hs.70811 hypothetical protein FU 2051 6 2.5 

426504 AW162919 Hs.17Oi60 RAB2, member RAS oncogene family-like 2.5 

428248 AI126772 Hs.40479 ESTs 2.5 

408813 AI580090 Hs.48295 RNA helicase family 2.5 

423504 N80077 Hs.24792 chromosome 12 open reading frame 5 2.5 

425441 AA449644 Hs.193063 Homo sapiens CONAFU1 4201 fis, clone NT 2.5 

443066 AW297921 Hs.255703 ESTs 2.5 

443556 AA256769 Hs.94949 methylmalonyt-CoA epimerase 2.5 

428943 AW086180 Hs.37636 ESTs, Weakly similar to KIAA1 392 protein 23 

425320 U29344 Hs.83190 fatty acid synthase 2.5 

430388 AA356923 Hs.240770 nuclear cap binding protein subunft 2, 2 2.5 

423242 AL039402 Hs.125783 OEME-6 protein 2.5 

416241 N52639 Hs.32683 ESTs 2.5 

440244 AI743977 Hs.205144 ESTs 2.5 

409239 AA740875 Hs.44307 ESTs, Moderately similar to 138022 hypot 2.5 

452464 AW500507 Hs.192619 KIAA1600 protein 2.5 

410718 AI920783 Hs.191435 ESTs 2.5 

408877 AA479033 Hs.130315 ESTs, WeaWy similar to A47582 B-ceO gr 2.5 

445150 AI446747 Hs.338704 olfactory receptor, family 7, subfamily 23 

407756 AA1 16021 Hs.38260 ubfquitln specific protease 18 2.5 

407633 NMJJ07Q69 Hs.37189 similar to rat HREV1 07 23 

449754 H00820 Hs.30977 ESTs, Weakly similar to B34087 hypotheti 23 

419316 AA236255 Hs.298419 ESTs 23 

429118 H20669 Hs.35406 ESTs, Highly similar to unnamed protein 23 

440331 AL046412 Hs.202151 ESTs 23 

449344 AI640355 Hs.312691 ESTs 23 

459006 AW298631 Hs.27721 Wolf-HJrschhom syndrome candidate 1-iik 2.5 

423165 AI937547 Hs.124915 hypothetical protein MGC2601 23 
411337 AW837349 gb:QV2-LT0038-27030O.108^12LT0038Homo 2.5 

438290 AA843719 Hs.122341 ESTs 23 

406414 C5000506*:gi|124941|spiP186l4ilTA1_RAT I 23 

424498 AB033043 Hs.149377 hypothetical protein DKFZp761L0424 23 

443464 BE548446 Hs3167 Homo sapiens mRNA; cDNA DKFZp434F1 52 (fr 23 

424856 AA347746 Hs.9521 ESTs, Weakly similar to ZN43_HUMAN ZINC 23 

440304 BE159984 Hs.125395 ESTs 23 
409045 AA635062 Hs.50094 Homo sapiens mRNA; cDNA DKFZp434O0515 (f 23 

422648 D86983 Hs.1 18893 Melanoma associated gene 23 

428819 AL135623 Hs.193914 KIAA0575 gene pnxluct 23 

412520 AA442324 Hs.795 H2A hlstone family, member 0 23 

430602 D13752 Hs.184927 cytochrome P450, subfamily XIB (steroid 23 

408031 AA081395 Hs.42173 Homo sapiens cDNA FU10366 fis, done NT 23 

403133 TargetExon 23 

298 



WO 02/059377 



PCT/US02/02242 



413189 BE070231 gb:QV4-BT0407-2601(XH)B7-f12 BT0407 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, parGa 2.5 

435509 AM58679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 
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TABLE 19A 



Table 19A shows the accession numbers for those pkeys lacking unigeneED's for Table 19. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey. Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accessions 

407647 1007366.1 AW860158 AW862385 AW860159 AW862386 AW862341 AW821869 AW821 893 AW062660 A W062656 
20 407980 103087J M046309 AI263500 AA046397 

408254 1049346 1 AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW80751 9 AW807239 AW807509 AW807356 AW807526 AW807098 
25 AW807307 AW807153 AW807295 AW807313 AWB07322 AW807265 AW807513 AW807516 AW807166 AW807501 

AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AWB07444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW807114 AW807518 AW807199 AW807211 
AW807498 AW807086 AWB07492 AW80721 8 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
3 0 AW807037 AW8071 28 AW807080 AW8071 18 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 

AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW807411 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807355 AW807223 AW807155 
3 5 409163 110418 1 AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366AA1 15845 AA075457 
AA064704 AA082878 AA075742 AA069162 
409695 1 14876J AA296961 AA296889 AA076945 AA077528 AA077497 
410534 1207247J AW905138 AW753008 R13818 Z43519 
410672 1214882.1 AW794600AW794730 
40 410784 1221005.1 AW8032Q1 BE079700 BE062940 

410785 1221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AWB03355 
410835 1223785J AW806906 AW808915 AWB66460 AW866475 AW866462 AW866448 AW866372 AW866604 
411050 1230330 1 AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 
411086 1231500 1 BE070800AW875226BE149115 
45 411093 1231970J BE067650AW817053 

411111 1232669J AW818127AW818161 R09719 
411171 1234393.1 AW820260AW820332 R94406 
411337 1239217.1 AW837349 AW837355 AW882717 
411514 1248638.1 AW850178 AW850233 AW850445 AW850446 
50 411670 1253680J AW856552 AW861101 AWB56574 AW861099 AW861100AW856573 AW856576 AW856562 
411905 1265181.1 BE265067 BE264978 AW875420 
412102 . 1277395.1 H56435 H56572 AW892929 
412209 1283610 1 AW901456 AW901450 AW901441 
412248 1285000.1 BE176480 AW903298 AW903313 
55 413043 1346556.1 BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

413111 1349546.1 BE065837 BE065805 BE065799 BE065818 BE065839 BE085831 BE055894 BE065789 BE065792 
413189 1352723.1 BE070231 BE070229 BE070255 

413221 1353887.1 BE161151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910J BE144884 H97942 
60 413708 1384140J BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 BE158685 
414210 1426051.1 BE383592BE261671 
414596 1465004.1 BE386B70 Z41986 H08501 
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414605 
415747 
416173 
417742 
417974 
418636 
419536 
420854 
422156 

422996 
423833 
423841 
423945 
425201 
426650 
426878 
430264 
430785 
431676 
433687 
434338 
434469 
435447 
437152 
437854 
439031 
439255 
444910 
445432 
446922 
447197 
448420 
448516 
450522 
450736 
451024 
451067 
451340 
452542 
452564 
453472 
454307 
454359 
454545 
454693 
454714 
455047 
455092 
455097 
455100 
455431 
455511 
455609 
455651 

455685 
455700 
455708 
455732 
455838 
455935 
455945 
456207 
456482 
458094 . 
458673 



1465790_-1 
155189.1 
1574973.1 
1696282J 
171237J 
177402J 
185688.1 
197072.1 
212379J 

223666.1 
232451J 
232507J 
233566 1 
247933.1 
270283J 
273265.1 
315008.1 
323486.1 
336411.1 
373061J 
383982.1 
387447.1 
406400J 
43386.1 
44418.1 
46798.1 
470321J 
624951J 
63943.1 
69865.1 
711623.1 
76273.1 
766241J ■ 
837264.1 
844652J 
85565J 
85759J 
86640.1 
921410J 
92227.1 
968371.1 
1106070.1 
1130674.1 
1223779.1 
1229132.1 
1230493J 
1250536.1 
1252971J 
1253130.1 
1253334.1 
1289854.1 
1321229J 
1337548.1 
1348732.1 

1350393J 
1351264J 
1352232J 
1353874.1 
1374605.1 
1384144.1 
1385588.1 
165078.-1 
192289.1 
47311J 
679507.1 



BE390440 

AA381209 AA381245 AA167683 
R52782R17313H24192R19876 
R64719Z44680R12451 
AA210765T95700 H94407 
AW749855 AA225995 AW750208 AW750206 
AA603305 AA244095 AA244183 

m^Z^n A\^^^^BA1433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 
AW835572 AW385512 A1334966 W32951 H62656 H53902 R88904 AWB35732 
BE091089 BE091123 AA319959 
AW503329 N46610 AA331571 
AW753967 AA370795 AA331630 AW962550 
AA4 10943 AW948953 AA334202 AA332882 
AA3521 1 1 AW962247 AA429695 
AA382814 AA40241 1 AA412355 
BE069341 AW748403 AL044891 AI908240 AA393080 
AA470519 BE303010 BE302954 BE384120 
Z30201M486132T72025 
A1685464 AW971336 AA513587 AA525142 
AA743991 AA604852 AW272737 
AW754311 AA630185 AW803285 
AA634608 C18732 AA729161 AA729860 
AI872932 AA682306 BE220163 W88695 T81307 H91447 

AL050027BE089051 
AL119723 AL119874 A1909018 U50537 
AF075079 H48601 H48795 
BE164500AA832198BE164502 
AI201849 BE069007 AW946544 
AV653771 BE089370 

BE1756Q5 Z43529 F06610 BE175602 AV661027 
. R36075AI366546 R36167 
BE623004 M380669 BE263627 QE246433 
AW898595 AW89858B AW898590 AW898663 AW898592 AI525093 



AW970060 AI732366 A1792313 AW839644 
AA442176AA259181 

BE172186 AA059279 AA020815 AA013437 
AW936273 AW340350 AA017208 
AW812256 AW812257 AI906423 AI906422 
AA026777 N5QQ55 R09961 N54721 
AL037925 AL037931 AL037957 
AW855717 AW362452 AW362443 
N71277AW390764 

AW806899 AW866451 AW866393 AW866297 AW817869 
AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 
AW815098BE154843BE154831 
AW852530 AW852527 AW852526 

AW85 2 ^?2 J^msm AWJ^97 AW855806 AW855796 AW855808 AW855793 AW855807 
BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 
AW938484 BE001245 BE001190 

1eSb^SbE011627 BE011679 BE011699 BE011678 BE011696 BE011675 BE011622 BE011635 
BEDM8G2 BE064979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 
BE064816 BE064850 BE064806 BE064796 BE064818 BE064975 BE064819 BE064810 BE064668 BE085059 
BE066976 BE066928 BE066927 

BE068115 BE068104 BE068102 BE068096 BE068103 BE068154 BE068198 

BE069326 BE069290 BE069352 

BE080908 BE072258 BE072190 BE072236 

BE145808 BE145807 BE181883 

BE158687BE158688 

BE160636BE160606BE160703 . 

AA193450 

AA485224 AA287308 AA258121 
AF086325 W72956 W73221 AA219112 
N99626A1302701 

301- 



WO 02/059377 



PCT/US02/02242 



TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: 
Ref: 

Strand: 
NLposition: 



Unique number corresponding to an Eos probeset 

8 TSSS; ™Z fl di9il numb ?!f ln co,umn are Genbank Identifier (GI) numbers. -Dunham I. et al." refers to the pubRcation 
sequence of human chromosome 22: Dunham I. et al., Nature (1999) 402:489-495. 
indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand Ntjjosition 



400555 
400608 
400610 
400925 
401045 
401049 
401093 
401256 
401283 
401326 
401418 
401451 
401458 
401497 
401508 
401575 
401747 

401781 
401785 
401793 
401987 
402077 
402109 
402184 
402376 
402421 
402578 
402606 
402696 
402765 
402800 
402820 
402892 
403133 
403356 
403388 
403426 
403585 
403593 
403637 
403639 
403677 
403775 
403943 



9801191 
9887666 
9887671 
7651921 
8117619 
7232177 
8516137 
9796573 
9800093 
9212516 
7452889 
6634068 
9187886 
7381770 
7534110 
7229804 
9789672 

7249190 
7249190 
7263888 
4406829 
8117414 
8131678 
8576001 
9625329 
9796341 



9909429 
7328818 
9367757 
6010175 
6456853 
8086844 
7331427 
8569930 
9438331 
9719529 
8101208 
6862650 
8671936 
8671948 
7331517 
7770580 
7711864 



Minus 

Minus 

Minus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Plus 

Plus 

Minus 

Minus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Plus 



134694-134817 
96756-97558 

117606-117928,124040-124147 

38183-38391,43900-44086 

90044-90184,91111-91345 

149157-150692 

22335-23166 

4548245620 

47256-47456 

226246-227505 

124865-125075 

119926-121272 

76485-77597 

92607-92813 

110779-110983 

76253-76364 

131258,131866-131932,132451-132575,133580-134011 

83215^M35.83531-«3656.8374O^3901 l 84237-e4393 r 84955^3736290-86814 

165776-165996,166189-166314.166408-166569.167112-167268,167387-167469,168634-168942 
102945-103083 

72893-73021.76938-77049 

65014-65195 

171722-171859,173197-173303 
112844-112986,113505-113636 
21753-22385 

4660946662.46758-46811,86293^6346,89776^9829.90048-90101,102817-102924 
66350-66496 

81747-62094 

23600-23731 

109588-109726 

43921-44049,46181-46273 

8227442443 

194384-194645 

38314-38634 

92839-93036 

112733-113001,114599-114735 

157156-158183 

131266-131769 

62554-62712,6944949602 

142647-142771,145531-145762 

113234-113326,115186-115287.119649-119766 

55008-55083,62860-63051 

102247-102326,103095-103148 

100742-100904,101322-101503 
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404091 
404097 
404142 
404253 
404274 
404285 
404360 
404440 
404443 
404552 
404561 
404580 
404721 
404826 
404983 
405037 
405041 
405095 
405153 
405196 
405232 
405246 
405336 
405394 
405460 
405494 
405547 
405609 



7684554 
7770701 
9856692 
9367202 
98851B9 
2326514 
9858450 
7528051 
7579073 
7243881 
9795980 
6539738 
9856648 
6572184 
4432779 
7543748 
7547195 
8072599 
9965565 
7230083 
7249042 
7259728 
6094635 
6624123 
7684569 
8050952 
1054740 
5757553 



405638 
405654 
405718 
405822 
405848 
405873 
405906 
405917 
405925 
405953 



406151 
406153 
406182 
406271 
406291 
406348 
406414 
406446 
406504 
406554 



4895155 
9795467 
6273498 
7651809 
6758747 
7705124 
7712162 
6758795 
7960374 
9117732 
7144806 
9929734 
5923650 
7534217 
5686274 
9255985 
9256407 
9454509 
7711360 
7711566 



Minus 

Plus 

Minus 

Minus 

Plus 

Plus 

Minus 

Plus 

Minus 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Plus 

Plus 

Minus 

Minus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Minus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Minus 

Minus 

Minus 

Plus 

Plus 

Minus 

Plus 

Minus 

Minus 

Plus 



82121-83229 

55512-55781 

80316-60459 

55675-66055 

104127-104318 

32282-32416 

122873-122966,151324-151469,153093-153253 

80430-81581 

87198-87441 

19854-20010 

69039-70100 

240588-241589 

173763-174294 

4772648046 

5117^51374,5200(W2173 
127374-127578 
121230-121714 
138877-139066 
175317-175500 
135716-135851 
125904-126063 
637-777 
33267-33563 
31900-32373 
52223-52389 
70284-70518 



52918,55469155601,571 1 1^7307.58169-58296,60215-60332.61482-61727 

4nnnM 4f>fVJ"70 4 GO QIC .100000 



199260-199372.199826-199929 
53624-53759 
113080-113266 

154660-154974,155203-155379 

28135-28244 

32129-32764 

10835-11059 

106829-107213 

129935-130282 

65101-65574 

68880-69374 

94087-94285 

12902-13069 

28256-28935 

36179-36692 

9562-9867 

71754-71944 

49593-49850 

116424-116527,118721-118859,121187-121364 

107068-107277 

106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 



10 



15 



20 



Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" nonnal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fii3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

PredProtDomains: Predicted Protein Domains 

Unigene Title; Unigene gene title 

R1 . Ratio of 93* percentile of tumor to 85* percentile of normal body tissue 
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408591 
400291 
449746 
407277 
400292 
424735 
426878 
42B848 
407178 
408000 
427585 
429441 
450375 
420931 
422109 
445730 
451110 
400297 
420813 
452744 
424634 
420757 
424399 
447350 
456207 
431448 
427217 
456938 
435496 
402578 
453160 
422505 
444342 



AF015224 Hs.46452 
AA401369 Hs.190721 
AI668594 Ks.176588 
AW170035 Hs.326736 
AA250737 Hs.72472 
U31875 Hs.272499 
BE069341 

NfvL000230Hs.194236 
AA195651 Hs.104106 
L11690 Hs.620 
031152 Hs.179729 
AJ224172 Hs.204096 
AA009647 Hs.8850 
AF044197 Hs.100431 
S73265 Hs.1473 



SS.Uterogtobin^S.Uteroglobin 
TM 

,SS,p450 
TM 

deafoZU5JMActivinjecp,pkinase ( 

.SS.TM 

TM 

SS.Leptln,SS,Leplin, 
,SS f Dlhydroorotase, 

Pledin.repeatSH3,spectrin,SS,Plectin - r 
SS,C1q,tollagen,SS,C1q,Collagen, 
.SS.Uteroglobin. 



AI624342 
AI955040 
A1127076 
X51501 
A1267652 



Hs.170042 
Hs.265398 
Hs.334473 
Hs.99949 
Hs.30504 



NM.003613HS.151407 
X78592 Hs.99915 
A1905687 Hs.2533 
AI375572 Hs.172634 
AA193450 

AL137517 Hs.334473 
AA399272 Hs.144341 
X52509 Hs.161640 
AW840171 Hs.265398 

AI263307 Hs.239884 
AL120862 Hs.124165 
NM_014398Hs. 10887 , 



UnigeneTHie 

mammaglobin 1 
ESTs 

ESTs. Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-8R 
BMP-R1B 

shonVchain alcohol dehydrogenase family 
gb:QV3^T0381-270100473-c08BT0381 Homo 
leptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/240kD) 
collagen, type X. alpha 1 (Schmid metaph 
Iipophiiin B (uteroglobin family member) 



.ss.uierogiomn, w*" w v««^««-; — / 

SS TM dislntegriaPep_M12B _propep,Repro a disintegrin and metalloproteinase doma 

' .. » emit I Inrli „<h\a rvtnVina R enhfamth/ (Cm 



SS.1L8.SS 
SS.Bombesin.SS 
,SS,TM,Catfon_efflux 
SS 
TM 
SS SS 

,SS.TM l 6NS1^SUR4 l cNMP_blnding l Rlla 
lg,tspJ,SS,AAA 

horrnone_recAndn>gen_recep,2f-C4. 
SS 

.pklnase, 

,SS,TM,p450,p450 
TM 
SS 



small inducible cytokine B subfamily (Cy 
gastrin-releasing peptide 
ESTs 

ESTs. Weakly similar to transformatiorK 
hypothetical protein DKFZp56401278 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gbzr40e07.ri Scares JlhHMPu_S1 Homosapl 
hypothetical protein DKF2p56401278 
ESTs 



.SSTM.arninotranJ 2 l CadhertruC_term,cad tyrosine aminotransferase 

o S ESTs, Weakly similar to transformatiorK 

SS,p450.SS.TM,p450 C1001134:gi|2117372lpir||l65981 fatty ac 

SS H2B histone family, member L 

SS ' programmed cell death 9 (PDCD9) 

Lamp,SS,TM.Lamp. similar to lysosome-assodated membrane 
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449765 
428227 



424001 
448595 
449448 
418007 
418994 
453595 
452401 
446591 
419296 
452838 
422805 
448390 
447342 
411869 
443709 
459587 
442580 
400289 
411598 
415263 
433805 
407276 
443348 
421037 
424086 
400295 
452461 
427365 
433365 



N92293 Hs.206832 
AA321649 Hs.2248 
D90041 Hs.155956 
W67883 Hs.137476 
AB014544 Hs.21572 
D60730 Hs.57471 
M13509 Hs.83169 
AA296520 Hs.89546 
AA441838 Hs.62905 
NM_007115Hs.29352 
H44186 Hs.15456 
AA236115 Hs.120785 
U65011 Hs.30743 
AA436989 Hs.121017 
AL035414 Hs.21068 
A1199268 Hs.19322 
W20027 Hs.23439 
AI082692 Hs.134662 
AA031956 

A1733682 Hs.130239 
X07820 Hs.2258 
BE336654 Hs70937 
AA948033 Hs.130853 
AA706910 Hs.1 12742 
AI951118 Hs.326736 
AW873596 Hs.182278 
AI684808 Hs.197653 
AI351010 Hs.102267 
W72838 Hs.2533 



N78223 
AI873274 



Hs.108106 
Hs.190721 



432596 
408771 
432912 
447033 
421155 
424905 
425398 
438167 
459583 
423945 



402606 
445263 
430217 
447164 
431385 
423887 
415385 
425704 



SS,IL8. 

,SSAcetyltransf2. 
.pWnase, 

LRRCT,LRR,SS,LRRCT 1 senre_carbpept 
SS 



425523 



418912 
422026 
451952 
438199 
400608 
413472 
432374 
402408 
445537 
451621 
405654 



ESTs, Moderately similar to ALUBJHUMAN A 
small inducible cytokine subfamily B (CX 
N-aceiyltransferase 1 (arylamlne N-acety 
paternally expressed 10 
WAAG644 gene product 
ESTs 



SS hemopexin,Peptidase_M1 0,SS,Peptkiase_matrix metatloproteinase 1 (MMP1; inters 
SSJec^n.c^ushl.EGF.SS.EGF.tectiiuc.su selecfin E (enfothetiaJ adhesion molecul 



SS 

.SS.CUB.XBnk. 

PDZ.SS 

SS 

SS.SS 

rfetone.SS.histone.histone 
SS 

.SS.Opocalin 
.SS.PeptidaseJ/11, 
.SS.TM.SNF 
,SS,UM, 
SS 



AF026944 Hs.293797 
AA576953 Hs.22972 
AJ224741 Hs.278461 
AW732573 Hs.47584 
BE007371 Hs.200313 
AI357412 Hs.157601 
H87879 Hs.102267 
NM_002497Hs.1 53704 
AL049689 Hs.156369 
R28363 Hs.24286 
A1907673 
AA410943 

AL360204 Hs.283853 

H57646 Hs.42586 
N47863 Hs.336901 
AF026941 Hs.17518 
BE178536 Hs.11090 
AL080207 Hs.134585 
R17798 Ks.7535 
U79293 Hs.159264 
NM_007050Hs.225952 
AB0O7948 Hs.158244 
BE440042 Hs.83326 
NM„000685Hs.89472 
U80736 Hs.1 10826 
AL120173 Hs.301663 
AW016531 Hs.122147 

BE242870 Hs.75379 

W68815 Hs.301885 
NA 

AJ245671 Hs.12844 

AI879148 Hs.26770 
NA 

AJ418055 Hs.161160 



hypolhetlcal protein FLJ14834 
tumor necrosis factor, alpha-induced pro 
PDZ domain containing 1 
ESTs 

preferentially expressed antigen in mela 
H2A histone famDy, member A 
hypothetical protein 

Homo sapiens, Similar to RIKEN cONA 2010 

ESTs 

ESTs 

gfczk15e04.s1 Scaresj>regnanLuterusJvlbH 
ESTs 



hemopexin l Peptidase^M10.SS.Peptidase_M10matrix metaBoproteinase 10 (MMP10; str 



hjstone t SS,histone,histone 
,SS,ttetone,histone 1 linkerJiistone 
,SS.RibosomaLL7Ae l 
TM 

.SS.DENN 
SS 

,SS.Lysy1_oxWase 
SS 

.SS.G9a.PHD, 
TM 

.SSJPR 

SSJM.UPF0016,SS,TM,UPF0016 
SS.EGF.vwa.SS.TM.vwa, 
TM,K^tBtraJorutraris.SS.TM l K L .tetra.ionJ 
,SS,TM,Fotete_carrier 
SS 



H3 histone famDy, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphoryiase kinase, delt 
programmed cell death 9 (PDCD9) 
lysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypothetical protein FU13352 
matrilin3 

potassium voltage-gated channel, delayed 
ESTs 
ESTs 



SS,LysyLoxWaseAUose_epim,Epimerase,S lysyl oxidase 
pWnase.SS.TM.pWnase.polyprenyLsynt NIMA (never in mitosis gene a)-relatea K 



SS 

t SS,WtmJ,p450,rrm 



a^m,ZU5,TM>\cfivin,recp,pkinase l 

SS 

SS 

.SSAcyttransferase, 



hypothetical protein similar to tenascin 
ESTs 

gb:l^T152-08039^004 BT152 Homo sapien 
gb2t32h03.r1 Soares ovary tumor NbHOTH 
Homo sapiens mRNA fuH length insert cDN 
NIVL024626:Homo sapiens hypothetical prat 
KIAA1560 protein 



IsS.RN/CjwlAR^oU^Rto^ Protein S24 



JM.IBR 
.SS.TM 

,SS,TM J BRCT,ank^BCJran,ABCJran 
.SS^orKJiead. 
SS 

l SS,TM.Yj)hosphatase,MAM 1 fn3 l 
.SS.lanijnin^B.laminin^EGF.Iaminin.Ntemi 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-Gke protein 

Human clone 23S46 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS,Peptidase„M10,hemopexin,SS,Peptldasejnatnx metalloprotefriase 3 (stromelysin 

+*r\ -n i 1. a rsn Tinim A onftrnfanctn ronontnr 1 



SSJM,7tmJ t SS,TM,7tmJ 
SS 

.SS.pkinase, 
.SSArfGap, 
SS.TKSSJM 
SS 
SS 

.SS.carb.anhydrase 
,SS,TM/as 
SS.ripocaMpocaQn, 
BTB.SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gii7508633iplrlIT25392 hypothe 
sotute carrier family 1 (glial high affi 
Homo sapiens cONA FU1 1346 fis, done PL 
NM_030920*:Homo sapiens hypolhetlcal pro 
EGF-Ckfi-dornain, multiple 6 (EGFU5) 
fatty add binding protein 7, brain 
C12001521^i[7513934|pirlU31081 cca3 pr 
ESTs 
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Hs.202676 
Hs.129598 
Hs.169849 
Hs.272620 



Hs.25933 
Hs.77367 

Hs.115263 



416220 N49776 Hs.170994 
431808 M30703 Hs.270833 
414142 AW368397 Hs.1 50042 
400298 AA032279 Hs.61635 
5 418601 AA279490 Hs.86368 
415539 AI733881 Hs.72472 
421451 AA291377 Hs«5083i 
429432 AI678059 
442441 AI820662 
10 426429 X73114 
406687 M31126 
448693 AW004854 Hs.228320 
419948 AB041035 Hs.93847 

426214 H59846 Hs.128355 
15 427718 AI798680 

414812 X72755 
400285 NA 
422330 D30783 
416182 NNL004354HS.79069 

20 420077 AW512260 Hs.87767 
452281 T93500 Hs.28792 
434531 AA642007 Hs.1 16369 
408380 AF123050 Hs.44532 
443788 AI732643 Hs.144151 

25 411078 AI222020 Hs.182364 
445495 BE622641 Hs.38489 
433426 H69125 Hs.133525 
424871 NM_004525Hs.153595 

426215 AW963419 Hs.155223 
30 409045 AA635062 Hs.50094 

435525 A1831297 Hs.123310 
409203 AA780473 Hs.687 
424902 NM_003866Hs.153687 
431725 X65724 Hs.2839 

35 418092 R45154 Hs.106604 
439840 AW449211 Hs.105445 
427811 M81057 Hs.180884 
420807 AA280627 Hs.57846 
426320 W47595 Hs.169300 

40 447078 AW885727 Hs.301570 
415786 AW419196 Hs.257924 
410102 AW248508 Hs.279727 
404347 

433687 AA743991 

45 421373 AA808229 Hs.167771 
422634 NMJ)16010Hs.118821 
453310 X70697 Hs.553 
435957 N39015 Hs.190368 
407771 AL138272 Hs.62713 

50 443646 AI085198 Hs.164226 
446142 AI754693 Hs.145968 
444649 AW207523 Hs.197628 
435147 AL133731 Hs.<!774 
439138 AI742605 Hs.193696 

55 429220 AW207206 Hs.136319 
428804 AK000713 Hs.193736 
453511 AL031224 
439809 R41396 
414869 AA157291 Hs.21479 

60 416276 U41060 Hs.79136 
452862 AW378065 Hs.8687 
452926 AI742170 Hs.31297 
453331 AJ240665 
420802 U22376 

65 450603 R43646 
422867 132137 
418004 U37519 



Hs.33102 
Hs.101774 



Hs.8895 

Hs.1334 

Hs.12422 

Hs.1584 

Hs.87539 



,SS,TM 

SS.TM,EGF,SS 

.SS.UDPGT 

TM 

SSJM,calreticuHn,SSJM,calreticutin, 

death,ZU5,TM f Advin_recp,pkinase, 

TM 



,SSJM,m3,lg, 



hypothetical protein MGC10946 

arnphireguiin (schwannoma-derived growth 

Homo sapiens cDNA FU14438 lis, done HE 

six transmembrane epithelial antigen of 

caimegin 

BMP-R1B 

ESTs 

synaptonema! complex protein 2 
ESTs 

myosin-biruiing protein C, siow-type 



SS,Peptidase^M10,hemopexin,SS l PepUdase_matrix metaltoprotelnase 11 (MMP11; stro 



SS 

Ferric_reductTM,Ferric_reduct, 
SS 

,SS ( TM,hisione,Sec1 .histone.sugarjr 
SS ( IL8,SS,IL8 

,TM,ABCJran r ABCjTiembrane. 

SS,TM,EGF,SS,TM 

cydin,SS 

SS 

.SS.TGF-betaJGFbjjropeptide, 
SS 

,SS,TM,ublquitin,7tmJ,ANF_receptor l sush 
TM 
SS.SS 



hypothetical protein FU23537 
NMJ)16931:H6mo sapiens NADPH oxidase 4 ( 
ESTs, Moderately similar to AIU7_HUMAN A 
ESTs 

monokine induced by gamma Interferon 

Eos Control 

eplreguBn 

cyc0nG2 

ESTs 

Homo sapiens cONA FLI11041 fis, done PL 

ESTs 

diubiquitin 

ESTs 

CocoaCrisp 



SS I SS,ENTHJ_LWEQ,ENTH J.LWEQ.DNAjriisjeESTs, Weakly similar to I38022 hypothetl 



,SS,TM 

SS.EGF.WLrBcepLa.ULrecepLb.SS.TM.E 

SS 

TM 

TM 

SS,p450 ( SS.p450 
SS.SS 

SS,CysJnot,SS 
l death7U5 f pkinase^ctivin_recp, 
SS 



ESTs 

low density lipoproteins elated protein 
stannfocaldn 2 

Homo sapiens mRNA; cDNA DKFZp434O0515 (f 
ESTs 

cytochrome P450, subfamily IV8. polypept 
inositol potyphosphate-4-phosphatase, ty 
Norrie disease (pseudogBoma) 
ESTs 

GONF family receptor alpha 1 



SS,ZrLcaroOpeptPropepJ^14,SSPropep_M14carboxypepfidase B1 (tissue) 



SS.cpnIO 
SS.TGF-beta i TGFb4)ropeptide ( SS 
.SS.kazal, 
SS 
SS 
SS 
TM 

.SSJMPDa.CJMPDH^N.CBS 
SS 

TM,SNF,SS,TM,SNF, 
,SS,TM 

.TM.cpneOJTCPl.Sema, 
,TSPNwtspJ,EGF,thiored, 
,TM,cadherin,Cadherin^C_term l 
.SS.rrm, 

,TM ( SOF,UPAR_LY6, 

TM 

SS 

.SS.UDPGT 
SS.SS 
SS 
SS 

SS.TW.TM 

,SS,Pep_M12Bj)ropep,Reproiys{n,tspJ t 
.SS.TM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical protein FU13782 

Homo sapiens cDNA FU 14035 fis, clone HE 

Target Exon 

gb:ny57g01.s1 NCLCGAPJW Homo sapiens 
ESTs 

CGl-62 protein 

solute carrier family 6 (neurotransmitte 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA OKFZp761C1712 (f 

ESTs 

ESTs 

hypofoetlcal protein FU20706 
transcription factor AP-2 beta (activati 
hypothetical protein FU23045 
ublnuctein 1 

UV-1 protein, estrogen regulated 
ESTs 

duodenal cytochrome b 



,SS,TM,disIntegrin,PepJ412Bj)ropep^epro ESTs 
SS«NA,mybJ)NA-bIndlng v-myb avian myeloblastosis viral oncogen 

SS ESTs 

SS,EGF,tsp J,SS,E2FJTDP, cartilage oligomeric matrix protein (COM 

SS.'atdedh.SS.aldedh, ~ aldehyde dehydrogenase 3 family, member 
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426451 
450701 
419519 
410555 
433138 
411558 
409079 
417275 
432731 
442818 
407366 
427427 
410785 
401045 

418986 
4420B2 
442861 
418836 
422060 
444381 
404091 
417866 
428819 
410275 
425236 
415669 
416319 
412140 
442942 
446163 
443162 
409602 
428479 
400300 
433404 
410079 
401781 
447359 
402230 
427674 



A1908165 

H39960 

AI198719 

U92649 

AB029496 

AA102670 

W87707 

X63578 

R31178 

AK001741 

AF026942 

AF077345 

AW603341 



Hs.169946 

Hs.288467 

Hs.176376 

Hs.64311 

Hs.59729 

Hs.70725 

Hs.82065 

Hs.295449 

Hs.287820 

Hs.8739 

Hs.177936 



C11001883*:gil6753278|n 
AI123555 Hs.81796 
R41823 Hs.7413 
AA243837 Hs.57787 
AI655499 Hs.161712 
R20893 Hs.325823 
BE387335 Hs.283713 
MA 

AW067903 Hs.82772 
AL135623 Hs.193914 
U85658 Hs.61796 
AW067800 Hs.155223 
NWL005025HS.78589 
AI815601 Hs.79197 
AA219691 Hs.73625 
AW167087 Hs.131562 
AA026880 Hs.25252 
T49951 Hs.9029 
Hs.256972 
Hs.184572 



SS.GATA, 
,SS,IRR 
SS 

.TMdishlegrin.Reprofysin, 
SS,lg,Sema,SS,Sema,efhand 
SSJM.SSJM 
,TM,m3, 

SS.eQiand.SS.efhand.ras 
.SS.frdMfrtf.&^fnl 
WO40.SS 
JM.IBR 
SS.Iedin_c,SS 
SS 

ICE_p20.SS.lCE_p10.ICE _j)20.lCE_p10,ICE_p 



GATA-blnding protein 3 (T-cell receptor 
Homo sapiens cDNA RJ12280 fis, done MA 
ESTs 

a dlsintegrin and metalloproteinase doma 
semaphorin sem2 

gamma-aminobutyric acid (GABA) A recepto 



paralbumin 
fibronectin 1 

hypothetical protein FU10879 
gb:Homo sapiens cig33 mRNA, partial sequ 
ESTs 

gb:IL2-UM0079-090300-05(W303 UM0079 Homo 



W26713 
Y00272 
X03363 
T32982 
U94362 



Hs.102720 
Hs.58589 



458098 
419968 
416636 
419703 
424687 
449679 
421296 
442117 
400303 
419440 
444858 
432239 
440705 
400286 
446466 
423201 
433043 
439509 
425247 
409757 
425292 
448045 
452681 
452243 
439310 



NNL012093HS.18268 
NA 

NM_003528Hs.2178 
AI249368 Hs.98558 
BE550224 Hs.74170 
X04430 Hs.93913 



N32536 
AI793257 
J05070 
AI823951 



Hs.42645 
Hs.128151 
Hs.151738 
Hs.129700 



NM_002666Hs.103253 
AW664964 Hs. 128899 
AA242758 Hs.79136 
AB020689 Hs.90419 
A1199738 Hs.208275 
X81334 Hs.2936 
AA904244 Hs. 153205 
NA 

H38026 Hs.308 
NM-000163Hs,125180 
W57554 Hs.125019 
AF086332 Hs.58314 
NM_005940Hs.155324 
NM.001898HS.123114 
NM_005824Hs.155545 
AJ297436 Hs^0166 
AF153330 Hs.30246 
AL355715 Hs.28555 
AF086120 Hs.102793 



!f|NPJ33938.1|c 
.SS.Reprolysin.tspJ, 
,mEPHJbd,pkinase.SAM ( fn3. 
SS 

,TM,ActMnjecp ( pldnase,death,ZU5, 

,SS,TM,CD36 

,SS,rruto_carr 

,TM,7trnJ,ANFjBceptor t 

SS.ColIagen.COLFl.TSPN.SS.TSPN 

ss.ss 

l SS,RibosomaLS4e 
SS 

.SS.serpii, 

SS,TM,ig.SS,TM 

.SSJdnesin, 

,SS,)g,Sema,pkinase, 

,SSJM,fn3 t 

filament.SS.fflamentfilament 
.SS/TM.DAGKa.DAGKc, 
,SS,pkinase t pkinase 



6.7 
ESTs 

ESTs; calsyntenin-2 

ESTs 

ESTs 

ESTs, Moderately similar to ALU5_HUMAN A 
ESTs, Weakly similar to S64054 hypotheti 
Target Exon 

colagen, type XI, alpha 1 
KIAA0575 gene product 
transcription factor AP-2 gamma (acfivat 
stannfocatcin 2 

serine (or cysteine) proteinase inhibito 
CD83 antigen (activated B lymphocytes, i 
RAB6 interacting, kinesln-Qke (rabkines 
ESTs 

prolactin receptor 
DKFZP434G032 protein 
ESTs 

ceil division cycle 2, G1toSandG2to 



,SS 1 TMpkhase,Recep_L.domain,SH2,PH l FurHER2 receptor tyrosine kinase (c-erb-b2, 



SS 

GlycoJransf_8,SS 

.SS.filamentPribosyllran.filamentArmad 

SS,adenylatekinase, 

.SS.TM^SQ, 

Wstone,SS,histone, 

.SS.TM 

SS 

SS.1L6JL6, 
,SS,TM 
,SS,zf-C2H2, 



ESTs 
glycogenln2 
Target Exon 
adenylate kinase 5 
Target Exon 

H2S histone family, member Q 
ESTs 

metallothianein 1E (functional) 
Interieukln 6 (interferon, beta 2) 
solute carrier family 16 (rnonocarboxytic 
ESTs 



SS l PeptkJase_M10,m2,hemopexin ) SS,TM r Pepmatrix metalloproteinase 9 (gelatinase B 
SS tofloid-tike 1 

SS perfitpin 
,SS,TM ESTs 
,SS,TM UV-1 protein, estrogen regulated 

SS WAA0882 protein 

SS ESTs, Weakly similar to ALUAJ1UMAN 1111 

SSPeptidase_M 10 l ternopexin,SSpeptidasejnatnx metalloproteinase 13 (collagenase 



TM 

SS,TM,ABCJran.ABCjTiembrane,SS 

arrestiaSS 

SS,TM,fo3,SS 

SS 

,SS,TM.Syntaxtn 

SS.PeptidaseJ/MO.hemopexin.SS 

,SS,cystatin, 

SS 

,SSJM 
,SS,TM 
SS 

^S.TM.UDPGT.casejn.kappa 



ESTs 

C16000922^iI7499103|pir||T20903 hypothe 

arrestln 3, retinal (X^rrestin) 

growth hormone receptor 

lymphoid nuclear protein (LAF-4) mRNA 

ESTs 

matrix metalloproteinase 1 1 (MMP1 1; stro 
cystafinSN 

37 kDa leudne-rich repeat (LRR) protein 

prostate stem ceB anBgen 

solute earner family 19 (thiamine trans 

programmed ceH death 9 

ESTs 



7.1 
7.1 
7.1 
7.1 
7.0 
7.0 
7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
6.7 



6.7 
6.7 
6.6 
6.6 
6.6 
6.6 
6.6 
6.6 
6.5 
6.4 
6.2 
6.2 
6.2 
6.2 
6.2 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.1 
6.0 
6.0 
6.0 
6.0 
5.8 
5.8 
5.8 
5.8 
5.7 
5.7 
5.7 
5.7 
5.6 
5.6 
5.6 
55 
5.5 
5.5 
5.4 
5.4 
5.4 
5.4 
5.4 
5.3 
5.3 
5.2 
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441111 AI806867 

452355 N54926 

427711 M31659 

418636 AW749855 

429353 AL1 17406 

441690 R81733 

430447 W17064 

429698 A1685086 

425325 X52730 



Hs.200102 
Hs.33106 
Hs.332648 
Hs.26339 
Hs.1692 

phenylethanotamine N-methyltransferase 
423600 A1633559 Hs.310359 SS 
414737 AI160386 Hs.125087 SS 
403593 NA 

407758 D50915 Hs.38365 
445234 AW137636 Hs.146059 
411165 NM.000169HS.69089 
420633 NMJ>14581Hs.274480 
414117 W88559 Hs.1787 
416783 AA206186 Hs.79889 
401093 

411096 U80034 Hs.68583 
457411 AW085961 Hs.130093 
436007 AI247716 Hs.232168 
450506 NM_004460Hs.418 
417975 AA641836 Hs.30085 
421072 AI215069 Hs.89113 
427032 AF012023 Hs.173274 
447752 M73700 Hs.105938 
403199 NA 

427122 AW057736 Hs.323910 
445900 AF070526 Hs.13429 
413048 M93221 Hs75182 
419563 AA526235 Hs,193162 
442432 BE093589 Hs.38178 
452093 AA447453 Hs.27860 
442323 AW016669 Hs.29190 
450606 AI668605 Hs.60380 
435542 AA687376 Hs.269533 
417576 AA339449 Hs.82285 
446089 AI860021 Hs.270651 
445413 AA151342 Hs.12677 
424420 BE614743 Hs.146688 
432378 AI493046 Hs.146133 
Hs.91668 
Hs.136985 
Ks.76915 
Hs.1406 



ESTs 

G protein-coupled receptor 34 
solute carrier family 25 (mitochondrial 
gb:QV4-BT0534-281299453-c05 BT0534 Homo 
ATP-blndlng cassette transporter MRP8 
ESTs 

SWI/SNF related, matrix associated, acti 
ESTs, WeaWy slmflar to S21348 probable 



Hs.126594 ,SS,TM,Phosphodiest 
Hs.29202 TM.7trru1.TM 
Hs.180408 SS 

,SS,TM,HECT 

.SSJM.ABCJrarvABCjnembrane, 
,SS,HECT,zf-UBR1,PABP.144-3, 
SS 

,SS,ras, 

SS.NNMT _PNMTJ'EMT I SS,NNMT_PNMT - .TEMT,STAR 

5.1 
ESTs 

SS ESTs 
,CIDE-N,pkinase Target Exon 

SS,SS KIAA0125 gene product 

,SS,TM ESTs 
SS,MeIiblase l BTK,PH ( pkinase l SH2 ( SH3,Ribo gatactosidase, alpha 
SS,Gpc^lin,SS,Gpocalin odorant-binding protein 2A 

.T^iorutrans.K^tetra, proteoHpid protein 1 

SSJM.TM monocyte to macrophage difTerentiation-a 

TM.LRRCT.TM.LRRCT, C12000586*:gi]6330167|db]jBAA86477.1| (A 

Peptidase j/13, mitochondrial intermediate peptidase 

SS ESTs 
.SS.adruzinc, ESTs 
SS t OPPlV_N - term,Peptidase_S9,SS,DPPlV - N - fibroblast activation protein, alpha 



.SS.trypsin 
SS 

.SS.14-3-3 

SS,transferrin.7tm_1 .transferrin, 
SS.TM.Folatejarrier.SS.TM/oiatejarrie 



hypothetical protein FU23186 
ESTs 

Integrin cytoplasmic domain-associated p 
bctotransferrin 

NMJ)25243*:Homo sapiens solute carrier f 



,SS,THpWrase l Reap_Ldomain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 



,SS,Ca_channet.B, 

SS,TM.tectiruc.fn2 l Ricin.B_!ectin ( SS,TM 
SS 
SS 

.SS.TNtftmJ, 
.SS.TM.CBS.voltageJXC 
,TM ( GlycoJ)ydroJ 
1 SS l pWnase.RhoGEF,& 1 PH,SH3 1 



Homo sapiens clone 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cONA FU1 1983 fis. clone HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f 
ESTs 

ESTs, Moderately similar to ALU6_HUMAN A 
ESTs 



452190 H26735 

434674 AA831879 

419986 AI345455 

421582 AI910275 

410361 BE391804 Hs.62661 

426327 W03242 Hs.44898 

406639 M97711 

452834 AI638627 Hs.105685 

427315 AA179949 Hs.175563 

446733 AA863360 Hs.26040 

442118 AA976718 Hs.202242 

421524 AA312082 Hs.105445 

453060 AW294092 Hs.21594 

453403 BE466639 Hs.61779 

444301 AK000136 Hs.10760 

453619 H87648 Hs.33922 

432656 NM.00O246Hs.3076 

426384 A1472078 Hs.303662 

431701 AW935490 Hs.14658 

416931 D45371 Hs.80485 
420854 AW296927 
418867. D31771 Hs.89404 

443514 BE464286 Hs.141937 
447499 AW262580 Hs.147674 



AIRS,formyLtransf > GARS,SS,GARSAlRS,for phosphoribosylglycinamlde formyltransfer 
,pkhase ESTs, Moderately similar to A47582 B-cel 

SS.UPF0099.SS.UPF0099, 
,SS,TM f MAPEG, 
.SS.TM.UDPGT 

,SS,TM,PH,SH2/urin-like l pklnase,RecepJ. 
.SS^isLdeacetyl. 



pkinase,OPR, 

SS,trefoa i SSJM,ldLrecepLa,SRCR,tryps 
SSJM.GBP.TM.GBP 
SS 

SS.SSto, 

,S$,DEAD,FofK_head 
SS 

,SS,TM.p450, 

.tg.Sema, 

SS 

.SS.ras.Yjjhosphatase.ras 

,SS.HMGJxw,filament, 

SS.LRR.SS 

SS 

SS^RR, 

.SS^fGap. 

.SS.BIR 

$S,C1q,Collagen,SS,C1q, 
t SS ( TM,Peptidase.M1, 
SS.homeobox.homeobox, 
.SS.TM.MIP, 

,SSJM,caa7ienn f cadherin 



CGM47 protein 
prostaglandin E synthase 
ESTs 

Homo sapiens clone PP1498 unknown mRNA 
ESTs 

GA-oinding protein transcription factor, 
trefoil factor 1 (pS2) 
guanylate binding protein 1, interferon- 
Homo sapiens clone TCCCTA00151 mRNA sequ 
gkHuman T-cell receptor (V beta 18.1, J 
WAA1688 protein 

Homo sapiens mRNA; cONA DKFZp564N0763 (f 
ESTs, Weakly similar to fatty acid omega 
ESTs 

GDNF family receptor alpha 1 
hypothefical protein MGC15754 
Homo sapiens cONA FU13591 fis, clone PL 
asportn (LRR dass 1) 

Homo sapiens, clone MGCS084, mRNA, comp 

MHC class II transacthrator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gb^^0-a|(>o<)7-(MJKs1 NCL_CGAP_Su 
msh {Orosophaa) homeo box homolog 2 
ESTs 

protocadherfnbeta 16 



5.2 
5.2 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 



5.1 

5.1 

5.1 

5.0 

5.0 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9 

4.8 

4.8 

4.8 

4.8 

4.8 

4.7 

4.7 

4.7 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.6 

4.5 

4.5 

4.5 

4.5 

4.5 

4.5 

4.5 

4.5 ' 

4.5 

4.5 

4.4 

4.4 

4.4 

4.4 < 

4.4 

4.4 

4.4 

4.3 

4.3 

4.3 

4.3 

4.3 

4.3 

4.3 

4.3 

4.3 
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441560 F13386 Hs.7888 


,pkinase, 


Homo sapiens clone 23736 mRNA sequence 


4.3 


409064 AA062954 Hs.141883 


,SS,CUB, 


ESTs 


4.3 


422667 H25642 Hs.133471 


,SS.TM ( FMf>like 


ESTs 


4.3 


454032 W31790 Hs.194293 


,SS,7W 


ESTs, Weakly similar to 154374 gene NF2 


4.3 


432663 A19B4317 Hs:1225B9 


TM 


ESTs 


4.3 


401747 


.SSJiIamentfilament 


Homo sapiens keratin 17 (KRT17) 


4.3 


432882 NMJ)13257Hs.279696 


pkinase,pkinase_C. 


semrn^Iucooorticotd regulated kinase-li 


4.2 


437036 A1571514 Hs.133022 


,SS,TM 


ESTs 


4.2 


447754 AW073310 Hs.163533 


.pkinase, 


Homo sapiens cDNA FU14142 lis, done MA 


4.2 


443194 A1954968 Hs.279009 


,SS,TM 


matrix Gla protein 


42 


451871 AJ821005 Hs.118599 


.SS.GDNF, 


ESTs 


42 


457211 AW972565 Hs.32399 


WH1.WH1 


ESTs, WeaWy similar to S51797 vasodilat 


4.2 


421566 NM_.000399Hs.1395 


zf-C2H2,SS 


early growth response 2 (Krox-20 (Drosop 


4.2 


431657 AI345227 Hs.105448 


,SS,TM,pkinase 


ESTs, WeaWy similar to B34087 hypothec" 


4.1 


427899 AA829286 Hs.332053 


,SS,SAA4)rotfiins f ABCjnembrane l ABCJran^erum amyloid A1 


4.1 


444779 A1192105 Hs.147170 


SS 


ESTs 


4.1 


442295 A1827248 HsJ224398 


,COLFI,vwc 4 CoIIagen, 


Homo sapiens cDNA FU11469 fis, clone HE 


4.1 


436396 AJ683487 Hs.152213 


.writ, 


wingless-type MMTV integration site fami 


4.1 


446039 AI150491 Hs.90756 


,TM,GlycoJiydro_1 


ESTs 


4.1 


422938 NM.001809Hs.1594 


,SS,TM,fliiolase t 


centromere protein A (17kO) 


4.1 


406922 S70284 


SS.TM.Desaturase.SS 


gb^tearoy^CoA desalurase [human, adlpo 


4.1 


439285 AL133916 Hs.172572 


l SS ( lg,pklnase i LRRNT,LRRCT, 


hypoftietical protein RJ20093 


4.1 


424800 AL035588 Hs.153203 


HLH.SS 


MyoDfarniry inhibitor 


4.1 


429922 Z97630 Hs.226117 


.SS.TM.nnkerJilstone^tm.l 


H1 hlstone family, member 0 


4.1 


447178 AW594641 Hs.192417 


,SS,TM 


ESTs 


4.0 


409038 T97490 Hs^0002 


88,118,83,118 


small Inducible cytokine subfamily A (Cy 


4.0 


452747 BE153855 Hs.61460 


.SS.HLH 


Ig superfamily receptor LNIR 


4,0 


420139 NMJX)5357Hs.95351 


.SSJM^SO, 


lipase, hormone-sensitive 


4.0 


408877 AA479033 Hs.130315 


,SS,TM 


ESTs, Weakly similar to A47582 B-ceB gr 


4.0 


403329 NA 


SS.SS 


Target Exon 


4,0 


439926 AW014875 Hs.137007 


SS 


ESTs 


4.0 


430832 AI073913 Hs.100686 


SS 


ESTs, Weakly similar to JE0350 Anterior 


4.0 


'432481 AW451645 Hs.151504 


,SS,tolfagen,COLFt.TSPN, 


Homo sapiens cDNA FU11973 fis. clone HE 


4.0 


452410 AL133619 Hs.29383 


,SS,TM.ras 


Homo sapiens mRNA; cDNA DKFZp434E2321 (f 


4.0 


418661 NM_001949Hs.1189 


SS 


E2F transcription factor 3 


4.0 


431958 X63629 Hs.2S77 


SS,TM t Cadherin_C_term l cadherin,SS l TM 1 cadcadhefin 3, type 1, P-cadherin (placenta 


4.0 


425071 NM_013989Hs.154424 


SS.T4 delodinase,T4 deiodinase, 


deiodinase, wdothyronine, type II 


4.0 


447197 R36075 


JM.SDF, 


gb:yh88b01.s1 Scares placenta Nb2HP Homo 


4.0 


428722 U76456 Hs.190787 


.SS.TIMP, 


tissue Inhibitor of metaDoproteinase 4 


3.9 


428330 L22524 Hs.2256 


SSPeptidaseJ^IO.SSPeptidase^MIO.hemopematrix metaDoproteinase 7 (MMP7; uterin 


3.9 


423242 AL039402 Hs.125783 


SS 


DEME-6 protein 


3.9 


449048 Z45051 Hs.22920 


SS.SS.TM 


similar to S68401 (cattle) glucose indue 


3.9 


414831 M31158 Hs.77439 


t SS,cNMP_bindjng l RIIa,HMG_box 


protein kinase, cAMP-dependent, regulato 


3.9 


413589 AW452631 Hs.313803 


,SS,abhydrolase 


ESTs, Highly similar to AF157833 1 none! 


3.8 


408875 NM_015434Hs.48604 


SS 


OKFZP434B168 protein 


3.8 


418629 BE247550 Hs.86859 


SS,SH2,PH,SS I TM,PH,SH2/ur1n-like 1 pkinas growfo factor receptor-bound protein 7 


3.8 


450787 ABO06190 Hs.25475 


SSJM.MIP.SSJM.MIP, 


aquaporln 7 


3.8 


414870 N72264 Hs.300670 


SS 


WAA1204 protein 


3.8 


450325 AI935962 Hs.26289 


SS 


ESTs 


3.8 


407633 NWL007069HS.37189 


TMJM 


similar to rat HREV107 


3.8 


426172 AA371307 Hs.125056 


.SS.DENN 


ESTs 


3.8 


442262 BE170651 Hs.8700 


.SS.START, 


deleted In Gver cancer 1 


3.8 


427961 AW293165 Hs.143134 


SS 


ESTs 


3.8 


445563 AW873606 Hs.149006 


.SS.WH1.WH1 


ESTs 


3.8 


403943 


p450,SS,p450 


C5000355:gi|4503225|refiNP_000765.1| cyt 


3.8 


408761 AA057264 Hs.238936 


.SSJMJtmJ, 


ESTs, Weakly similar to (defline not ava 


3.8 


• 423279 AW959861 Hs^90943 


SS 


ESTs 


3.8 


420440 NWL002407HS.97644 


.SS.SRCaUteroglobln 


mammaglobki 2 (MGB2; mammaglobin B; Op 


3.8 


445107 A1208121 Hs.147313 


,SS,TM 


ESTs, WeaWy similar to 138022 hypothec 


3.7 


428303 AW974476 Hs.183601 


SS,RG$,RGS,RGS 


regulator of G^Jrotein signalling 16 


3.7 


411667 BE160193 


TM 


gb:QV1 W0413-O10^&0594i03 HT0413 Homo 


3.7 


427809 M26380 Hs.180878 


,SS.Bpase.PLAT,Sec7PH. 


lipoprotein npase 


3.7 


418203 X54942 Hs.83758 


CKS.SS.CKS, 


CDC28 protein kinase 2 


3.7 


430376 AW292053 Hs.12532 


SS 


chromosome 1 open reading frame 21 


3.7 


444190 A1878918 Hs.10526 


SS 


cysteine and glyclne-nch protein 2 


3.7 


433495 AW373784 Hs.71 


SS.IgMHCJ.connexin.SCAN.SS.TM 


aipha-2-glyco protein 1, zinc 


3.7 


429638 A1916662 Hs.211577 


SSJM.SS 


kinecUn 1 (Wnesin receptor) 


3.7 
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454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407S9S 
425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



A1041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW194426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.33213 
AW301344 Hs.122908 
NMJ>14735Hs.82292 
AJ126271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.1 19597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA5O2490 Hs.336695 
NM^000909Hs.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
AI791493 Hs.129873 
AI308876 Hs.103849 
AF245505 Hs.72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NMJ002543HS.77729 
AA116021 Hs.38260 
A1651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM.014918HS.110488 
AK001423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM.002316HS.83354 
BE390551 Hs.77628 
NH.003512HS.28777 
NM.000346Hs.2316 
AA442324 Hs.795 
M25809 Hs.64173 

NMJ)04585Hs.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AH82277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM.005419HS.72988 
AW968504 Hs.123073 

BE274552 HsJ6578 
AW797437 Hs.69771 
H73505 Hs.1 17874 
086961 Hs.79299 
NM_006456Hs.288215 



,TM.7tmJ, 

.SSJM.EGF.WLrecepLa.ldLrecepUEGF 
.SS.AAA, 

,SS,Glycos_transL2, 
SS 

GlycosJransf^.RidnJUectin.SS.GIycos 

npccaSn.SS.ripocarnjipocalin.fcrrltj 

.SS.Pribosyitran.SuIfatase 

PHD.pkhase.SS 

SS 

SS.TM.fn3.SS.TM.fti3. 
.SS.TM.elhand.ionJrans 



SSJM t aJB l F5_F8^type„C,^ l SSJM.CUB l F5_neuropiIin1 



ESTs 

ESTs, Weakly simOar to ALMJ-iUMAN ALU S 

ESTs 

ESTs 

ESTs 

UOP-N-acetyl-a!pha-0-galactosamine:potyp 
laity acid binding protein 4, adipocyte 
DNA replication factor 
KlAA0215gene product 

ESTs, Weakly similar b YZ28 JiUMAN HYPOT 
prolactin receptor 

secreted phosphoproteln 1 (osteopontin, 



SS,TM,Desaturase,SS 
SS 

.SS.7tmJ.SPRY, 

,SS,TM,GDA1_CD39 

.SS.fiiament 

SS,homeobox,Ets,SSMneobox, 
SS 

SS.TM.7tmJ, 
SS 

,SS,TM,PMP22LCIaudin, 
.SS,TM,IL8 
,SS,PH,lactamase_B 
,SS,p450.p450 



stearoyi-CoA desaturase {delta-9-desatur 
chord in-Oke 
ESTs 

gb:ye53h05.s1 Soares fetal liver spleen 
TaigetExon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptorYl 
ESTs, Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfamily A (Cy 
GPl-anchored metastasis-associated prote 
EST s, Weakly similar to A36036 cytochrom 
J^hemopexin.PeptidaseJ^IO.heniopexin.Pehypothetical protein DKFZp761D112 
ig.LRRCT, DKFZP564I1922 protein 

.SS.TM ESTs 

,SS,HMG_box, Homo sapiens, clone IMAGE:3351295. mRNA 

,SS,TM,SH2,PH,pkinase 1 RecepJ^domain,Fun^rb-b2 avian erythroblastic leukemia v 

HER2 receptor tyrosine kinase (c-erW>2 ( 
serum-lndudbie kinase 
hypothetical protein FLJ20093 
EST 

oxidised low density lipoprotein (lectin 
ublquffln specific protease 18 
ESTs 
cycfinBI 

ESTs, Weakly similar to ALU5JWMAN ALU S 
K1AA0990 protein 

Homo sapiens cONA FU 10561 fis, done NT 
parathyroid hormone receptor 2 



pkinase,POtOJbox l SS,pWnase,POlO_box. 
.SS.tg.pkinase.LRRNT.LRRCT, 
,SS,TM,KRAB,SCAN,zf^2H2,ig 
,SS,TM 

SS ( UCH-1.UCH-2 f SS,TM,Gjlu.transpept 
SS 

cydln,SS,TM,cyclin, 

SS 

SS 

SS 

SSJM l 7tnU,SS,TM,7trrL2 



JM,ABC_niembrane^BC_tran,RibosomatS4eCbp/p3WWnte transactivator, wit 
,SS,TM,1ransporU)rot,SWIB,RhoGAP ( DAG_PEglycerd-3-phosphate dehydrogenase 1 (so 
,SS,rrm,PABP,pkinase,14-3-3,rrm poly(A)-binding protein, cytoplasmic 1-1 

,SS,TM.mitojarr,Lysyl_oxidase lysyl ox!dase-Me 2 

SS,START,SS,START,NNMTJ :> NMT_TEMT I steroidogenic acute regulatory protein r 
SS4iistone ( Calc_CGRP - IAPP,ig.MHCJ,SPRY^2A histone family, member L 



SS.HMGJwx, 
histone.SS.histone.BolA 
ATP-synLab.$S,7trrLlATP*ynLab 
filamentSS.filament.filament 
TM 

,SS,TM,RibosomaLL27e, 

.Kunltz^BPTI, 

SS,p450,SS 

SS 

SS.IL8. 

SH2.STAT.SS.STAT 
.pkinase, 

RhoGEF,PH,SS,RhoGEF,PH, 

SAP.SS.FG-GAP.vwa 

SS^ushi.trypsln.vwajrm.fibrinogerLCfn 

,SS,Peptidase - S8P,PeptWase_S8,P 

,SS,TM 

.SS.Pribosyttran, 
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SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
NM.005557*:Homo sapiens keratin 16 (foca 
retinoic add receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamSy JIB (phenobar 
ESTs. Weakly similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
COC2-related protein kinase 7 
NMJW3371*:Homo sapiens vav 2 oncogene ( 
protein Inhibitor of activated STAT3 
B-factor, properdin 
ESTs 

Bpoma HMGIC fusion partner-flke 2 
siafyltransferase 
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Hs.21342 
Hs.339665 



Hs.164680 
Hs.280380 
Hs.134981 



Hs.57664 . 
Hs.133483 



445462 AA378776 Hs.288649 
439452 AA918317 Hs.57987 
452017 AF109302 Hs.27495 
409099 AK000725 Hs.50579 
452106 A1141031 
447519 U46258 
426928 AF037062 Hs.172914 
438825 BE327427 Hs.79953 
414575 H11257 Hs.22968 
417837 AL079905 Hs.1103 
422128 AW881145 
445941 AI267371 Hs.172636 
429973 AI423317 
444542 AI161293 
459561 AI547306 
425741 AF052152 Hs.159412 
426501 AW043782 Hs.293616 
455508 AA502764 Hs.123469 
434228 Z42047 Hs.283978 
415752 BE314524 Hs.78776 
400419 AF084545 
439750 AL359053 
423858 AL137326 
428514 AW236861 Hs.193139 
428698 AA852773 Hs.334838 
448988 Y09763 Hs.22785 
432072 N62937 Hs.269109 
417433 BE270266 Hs.82128 
452194 AI694413 Hs.332649 
444051 N48373 Hs.10247 
420042 AW01514O Hs.161723 
457292 AI921270 Hs.334882 
421458 NM_003654Hs.104576 
431104 AW970859 Hs.313503 
443767 BE562136 Hs.9736 
419589 AW973708 Hs.201925 
415447 297171 Hs.78454 
443464 BE548446 Hs.5167 
423431 AA326062 
413278 BE563085 Hs.833 
458451 AW297181 Hs.195922 
440449 AA885430 Hs.201925 
413753 U17760 Hs.75517 
434876 AF160477 Hs.61460 
435575 AF213457 Hs.44234 
415773 R21651 Hs.324725 
446440 AV658411 Hs.42656 
450847 NM_003155Hs.25590 
426075 AW513691 Hs.270149 
452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
402837 NA 

439451 AF086270 Hs.278554 
406664 L34041 Hs.9739 
417315 A1080042 Hs.336901 
413011 AW068115 Hs.821 
414987 AA524394 Hs^94022 
429197 H24471 Hs.26930 
' 448030 N30714 Hs.325960 
407604 AW191962 Hs.249239 
419092 J05581 Hs.89603 
456672 AK002016 Hs.1 14727 
443171 BE281128 Hs.9030 
452256 AKD00933 Hs^8661 
432201 AI538613 Hs.298241 
406642 AJ245210 
400903 NA 



SS.SS 

ss.ss 

ss 

ss 

ss 

ss 

.SS.adlushortTGF-beta.TGFbjjropeptide 
,SS 1 TM,hIstoneANF.rBceptor l guanylate_cy 
.SS^kinase.ig, 

SS.TGFbjj/opepfide.TGF-bela.SS 
SS 

SS,SS,Iipoxygenase,PLAT 
,SS,T-box,UDPGT 

SS,SS.Peptidase^M1 r EGFJg l tecfirLC l sushI 
SS 

.pkinase, 

SS 

SS 

.ssjMjtmj 

TM 

,SS,Peptidase_M1, 

JM/integiiiuB ( Ricin^B_tectln.mTi 

.SS.TM 

.SS.START.NNMTJ'NMT^TEMT, 
SS 

SS.TM.TM 
.Ssma,^, 

SS,TM,LRRCT,IKRNT,U*R,TM.LRRCT, 
) SS l TM < 7frTL3^NF.receptor l sushi 
.SS.ig, 
SS CUB 

SS.TM.SSJM.G-patch 
SS 

,$ema,ig, 

,SS,PCI,RasGEF.horniorfljrec f zf-C4, 
.FGF, 

SS.OLF.OLF.OLF^Ibosomal^ 

SS,TM.SSF,SS.TM 

.SS.p450.p450 

.SS.TM.ubiquitJn.lamlniruG.laminuuEGF.k 
,SS,RlbosomaLL14 
,FGF, 



hypothetical protein MGC3077 

B-cell CLUlymphoma 1 1B (zinc Onger pro 

prostate cancer associated protein 7 

hypothetical protein FU20718 

ESTs 

ESTs 

retino) dehydrogenase 5 (1 1-cis and 9-d 
ESTs 

Homo sapiens clone IMAGE:451939. mRNA se 
transforming growth factor, beta 1 
gb:QV0OT0033-01040T>182^07 OT0033 Homo 
ESTs 
ESTs 



ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs. Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length Insert cON 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

K1AA1866 protein 

gamn^a-aminobutyric acid (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblasl glycoprotein 
olfactory receptor, family 2, subfamily 
activated leucocyte ceil adhesion motecu 
ESTs 

hypothetical protein FU14251 
carbohydrate (keratan sulfate Gal-6) sul 
ESTs 

proteasome (prosome, macropain) 26S subu 
Homo sapiens cDNA FU13446 fis, done PL 
myocilin, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb:EST29171 Cerebeflum ll Homo sapiens c 
interferon-stimulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FU13446 fis, done PL 



SS.iaminh^GFJamlnirU^ltemi.adh_short,S laminin, beta 3 (nicein (125kD), kalinln 



.SS.HLH 
SS.Ig.SS ( TM 

i SS,TM ( RibosomaLS3Ae.G-gamma 
SS 

.SS.homeobox, 
,SS,m3. 

,SS,TM,Activin_recp 1 pklriase 

,TM,p450.Ets 

SS 

.SS.Chromojshadow.chromo, 



lg superfamily receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B«cel 
K1AA1681 protein 
stanniocakan 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FU11309 fis. done PL 
platelet-acfivating factor acetyihydrola 
ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-like protein 1 



,SS,TM.transporUmtSWIB,RhoGAP,DAG„PEglycero^ 1 (so 

> SS,RNAj»U,RNAjwLA2.Rto protein S24 



SS.LWUWNT.SS.LRRNT.LRR. 
l SS,connexin^ornwnejec^f-C4,connexin 
.SS.GelsoBn. 
,SS,TM 
,SS,TM.C1q, 
SS.TM.SEA, 
.SS.PK.PK 
SSJM.7tnv.1.rrm.SS 
,TM,GDl,7tm_1. 

SS.TM.trypsin.SS.TM.trefofl.trypsln.tref 

SS 

SS 
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bigtycan 
hypothetical protein RJ14950 
ESTs, Weakly simflar to T20272 hypotheti 
membrai^panning 4 -domains, subfamily A 
collagen, type Vlll, alpha 2 
mucin 1, transmembrane 
Homo sapiens, done MGC:16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fts, clone HE 
Transmembrane protease, serine 3 
gbflomo sapiens mRNA for immunoglobulin 
Target Exon 
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434408 AI031771 Hs.132586 .SS.GlycoJiydroJ ESTs 3.1 

452994 AW962597 Hs.31305 SS,WD40,SS,WD40, WAA1547 protein 3.1 

445903 AI347487 Hs.132781 m3,SS,TM,EF1BD class I cytokine receptor 3.1 

424364 AW383226 Hs.201 189 SS ESTs, Weakly similar to G01763 atrophin- 3.1 

5 410196 AI936442 Hs.59838 UBACTjepeatSS,UBACTjepeatThiF_^ 3.1 

419150 T29618 Hs.89640 ,TM,rjklnase,fn3 l TEK tyrosine kinase, endothelial (venous 3.1 

433417 AA587773 Hs.8859 .SS.SRCR, Homo sapiens, Similar to RiKEN cONA 5830 3.1 

418624 AI734080 Hs.104211 ,Sema.ig, ESTs 3.1 

436291 BE568452 Hs.5101 .SS.abhydrolase, protein regulator of cytokinesis 1 3.1 

10 411000 N40449 Hs.201619 SS ESTs, WeaWy similar to S38383 SEB4B pro 3.1 

412519 AA196241 Hs.73980 SSJroponin>lemaggIutlnin,SS l TM l C2 1 Tropo troponin T1, skeletal, slow 3.1 

450223 AA41B204 Hs.241493 .SS.proJsomerase, natural klUer-tumor recognition sequenc 3.1 

422790 AA809875 Hs.25933 l TM l hlstone,Sec1,histone,sugar.tr ESTs 3.1 

424269 AW137691 Hs.199754 ,SS,TM,7tnL2.GPS ESTs 3.1 

15 435854 AJ278120 Hs.4996 ,SS,WD40 putative ankyrin-repeat containing prate 3.1 

447388 AW630534 Hs.76277 ,SS,TM,rnn ) oxidoredji6.oxloored^q6 Homo sapiens, clone MGC:9381,mRNA,comp 3.1 

451631 R00866 SS gb:ye79cOZs1 Soares fetal Over spleen 3.0 

448105 AW591433 Hs.298241 ,SS,TM,trefoll,trypsln,trefoa Transmembrane protease, serine 3 3.0 

438637 BE500941 Hs.126730 JM.PH, ESTs, Weakly similar to K1AA1214 protein 3.0 

20 423024 AA593731 Hs.325823 .SSJM.CD36 ESTs, Moderately similar to AUJ5 JiUMAN A 3.0 

456592 R91600 ,SS,RarLBP1,UM,Ran_BP1,GRlP l TPR,UM gb:yq10c0lr1 Soares fetal fiver spleen 3.0 

425920 AL049977 Hs.162209 SS,mSS,THPIvP22^Claudln,P^2Z.CIaudinclaudin8 3.0 

444670 H58373 Hs.332938 ,SS,TM hypothetical protein MGC5370 3.0 

401197 art.arf, ENSP00000229263*:HSPC213. 3.0 

25 437755 AW204256 Hs.291887 ,wnt, ESTs 3.0 

452560 BE077084 Hs.336432 ,SS,rrm,zf^anBP l pkinase ( C2,pkinase_C,DA ESTs 3.0 

410274 AA381807 Hs.61762 SS.SS hypoxia-inducible protein 2 3.0 

450098 W27249 Hs.8109 SS hypofiietical protein FU21080 3.0 

404826 ,SS.TM Target Exon 3.0 

30 458389 H70284 Hs.160152 .SS.RA ESTs, Weakly similar to FPHU alpha-fetop 3.0 

408196 AL034548 Hs.43627 HMGJJox,pWnase^CHC^SJM^MG_pox,SRY (sex determining region Y)-oox 22 3.0 

•'433675 AW977653 Hs.75319 ,SS,rtoonucjed_sm, ribonucleotide reductase M2 polypeptide 3.0 

418848 AI820961 Hs.193465 ,deam,ZU5.p4dnase,Activinjecp 1 ESTs 3.0 

422095 AI868872 Hs.282804 SS.Cu^xidase.SS.Cu-oxldase.Cu-oxidase hypothetical protein FU22704 3.0 

35 415992 C05837 Hs.145807 .SS.TM hypothetical protein FU13593 3.0 

424631 AA688021 Hs.179808 SS ESTs 3.0 

409956 AW103364 Hs.727 SS JGF4»ta,TGFbj)ropeptide,SS,TGF-beta.lnhlbin I beta A (actMn A, activin AB a 3.0 

419667 AU077005 Hs.92208 SS,disintegrin.ReprolyslnPep - M12B_prope a disintegrin and metalloproteinase doma 3.0 

450946 AA374569 Hs.1 27698 SS ESTs, Moderately similar to 2109260A B c 3.0 

40 447770 AB032417 Hs.19545 Frizzled,Fz,SS,TM,Frizzled,Fz. frizzled (OrosophBa) homolog 4 3.0 

423826 U20325 Hs.1707 SS.SS cocaine- and amphetamine-regulated trans 3.0 

418838 AW385224 Hs.35198 ,SS,TM,Phosphodiest, ectonudeotidepyrophosphatase/phosphodi 3.0 

442804 AW300118 Hs.131257 .SS.TM.G-gamma ESTs 3.0 

432284 AA532807 Hs.105822 .SSJM.pkinase, ESTs 3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 



Pkey: 


Unique Eos probeset identifier number 


CAT number Gene duster number 


Accession 


; Genbank accession numbers 


Pkey 


CAT number 


Accessions 


410785 


1221055J 


AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334.1 


BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


418636 


177402J 


AW749855 AA225995 AW750208 AW750206 


420854 


197072J 


AW296927 AJ684514 AI263168 AA281079 


422128 


211994J 


AW881 145 AA490718 M85637 AA304575 T06067 AA331991 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566J 


AA410943 AW948953 AA334202 AA332882 


426878 


273265J 


BE069341 AW748403 AL044891 AI908240 AA393080 


433687 


373061.1 


AA743991 M604852 AW272737 


447197 


711623 1 • 


R36075AI366546R36167 


451631 


878098 J 


R00866 R01523A1806815 


456207 


165078.-1 


AA1 93450 


456592 


202684J 


R91600 T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. ■Dunham I. et al. refers to the publication 

entitled The DNA sequence of human chromosome 22.' Dunham I. et a!., Nature (1999) 402:489-495. 
indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



15 



20 



25 



30 



35 



40 



10 Ref: 



Strand: 
NLposffion: 



Pkey Ref 

400608 9887666 

400903 2911732 

401045 8117619 

401093 8516137 

401197 9719705 

401747 9789672 



Strand NLposition 



401780 
401781 
401866 
402230 
402408 
402578 
402606 
402837 
403199 
403329 
403593 
403943 
404091 
404347 
404826 
405366 
405654 



7249190 
7249190 
8018106 
9966312 
9796239 
9884928 
9909429 
9369121 
9958183 
8516120 
6862650 
7711864 
7684554 
9838195 
6572184 
2182280 
4895155 



Minus 

Plus 

Plus 

Minus 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus. 

Minus ■ 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Minus 



96756-97558 
59112-59228 

90044-90184,91111-91345 
22335-23166 

118591^ 

131932,132451-132575,133580-134011 

28397-28617^8920-29045^9135-292963411-29567^9705-29787,30224-30573 

832l5-83435,83531-63656,83740-83901»84237-64393,84955-85037»B6290-86814 

73126-73623 

29782-29932 

110326-110491 

66350-66496 

81747-82094 

201^2186.9570-9758,11136-11^19429-19677^^ 



96450-96598 

62554-62712,69449-69602 

100742-100904,101322-101503 

82121-83229 

74493-74829 

47726-48046 

22478-22632 

53624-53759 



314 



WO 02/059377 



PCT/US02/02242 



TABLE 21 : 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

10 molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 

1 5 peptidase, phosphatase, ATPase, or ionjransporter domains). The predicted protein domains 
are noted. 



Pkey: Unique Eos probeset identifier number 

ZU ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unjgene Title: Unigene gene lite 

R1 : Ratio of 93rd percentile tumor to 85* percentile of normal body tissue 
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30 



35 



40 



45 



50 



55 



Pkey ExAccn UnlgenelD Predicted Protein Domains 



449746 
400292 
424735 
407178 
408045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 
456207 
456938 
402578 
425692 
424001 
418007 
421727 
411869 
400289 
443348 
424086 
400295 
408771 
421155 
424905 
438167 
459583 
423945 
445263 



AI668594 Hs.176588 
AA250737 Hs.72472 
U31875 Hs.272499 
AA195651 Hs,104106 
AW138959 Hs.245123 
AA009647 Hs.8850 
NM_001394Hs.2359 
A1624342 Hs.170042 
NM_003613Hs.151407 
X78592 Hs.99915 
A1905687 Hs.2533 
AI375572 Hs.172634 
AA1 93450 

X52509 Hs.161640 

D90041 Hs.155956 
W67883 Hs.137476 
M13509 Hs.83169 
Y13153 Hs.107313 
W20027 Hs.23439 
X07820 Hs.2258 
AW873596 Hs.182278 
AI351010 Hs.102267 
W72838 Hs.2533 
AW732573 Hs.47584 
H87879 Hs.102267 
NMJX)2497Hs.153704 
R28363 Hs.24286 
AI907673 
AA410943 

H57646 Hs.42586 



SS.D450 

deafo,ZU5JM,Activin_recp,pkinase l 
SS.TM 

SS.Dihydroorotase, 
PhosphodlestSomatomediruB, 

SS,TM l disintegrin,Pep^M12Bjjropep,Reprol a dlsintegrin and metafloproteinase doma 



UnigeneTitle 

ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
BMP-R1B 

short-chain alcohol dehydrogenase family 
ESTs 
ESTs 



DSPcRhodanese, 
SSJM.Catfon^enlux 
fc,tspJ,S$,AAA 

hormon8_recAndrogen_recep,zf-C4 l 
SS 

pkinase, 

SSJM.p450.p450 



dual specificity phosphatase 4 
ESTs 



nu 



androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb2r40e07j1 Soares_NhHMPu_S1 Homo sapi 



SSJM,ardnotranJ_2,Cadr^nj;jerm ) cadhtyro^^ 
SS.p450.SSJM.p450 C1001134:gi|2117372|pir||l65981 fatty ac 

SS^cetyltransG, N^cetyltransferase 1 (arylamine N-acety 

Pkinase. paternally expressed 10 

SS,hemopexin,Pepfidase^M10,SS l PepMasejriatnx metafloproteinase 1 (MMP1; Inters 

kynurenine 3-monooxygenase (kynurenine 3 
SS,Pepiidase_M1 I ESTs 

herropexin,PeptHaseJrf10 f SS^^ melalloproteinase 10 (MMP10; str 

SS.DENN caimodurin 2 (phosphorylase kinase, delt 

SS.LysyLoxidase lysyl oxidase 

aldehyde dehydrogenase 9 family, member 
TM,K_tetrajon_trans,SS JMX_tetra,ion_t potassium voltage-gated channel, delayed 
SS,LysyLcxtoe>Udose_epim,Epinierase,S fysyl oxidase 
pWnase.SSJM.pldnase.polyprenyLsynt NIMA (never in mitosis gene ablated k 
SSJM.7tnu1,p450,rrm ESTs 

P*"* 8 * gb:ILeT152-08G399404 BT152 Homo sapien 

toth2U5.TM^ctivin_recp i pkinase ) gb:zt32hD3.r1 Soares ovary tumor NbHOT H 
SS^cyltransferase, KIAA1560pratefo 
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R1 

65.7 
55.9 
53.8 
393 
345 
25.7 
245 
24.1 
21.7 
21.7 
203 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
13.5 
13.0 
12.8 
12.7 
11.9 

m 

11.7 

115 

11.5 
11.4 
11.2 
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423887 AL080207 Hs.134585 
429859 NM_007050Hs.225952 
428368 BE440042 Hs.83326 
418912 NH.000685HS.89472 
451952 AL120173 Hs.301663 
.402408 NA 

.415539 AI733881 Ks.72472 
406687 M31126 Hs.272620 
419948 AB041035 Hs.93847 
400235 NA 

408380 AF123050 HsM532 
409203 AA780473 Hs.687 
424902 NM.003866HS.153687 
431725 X65724 Hs.2839 
418092 R45154 Hs.106604 
427811 M81057 Hs.180884 
423554 M90516 Hs.1674 
426261 AW242243 Hs.168670 
413374 NH_001034Hs.75319 
432677 NWL004482HS.278611 
456986 D38299 ' Hs.170917 
453331 AI240665 
407721 Y12735 
418004 U37519 
410555 U92649 
443695 AW204099 Hs.337720 
423545 AP000692 Hs.129781 
401045 

C11001883 # gil6753278|refINPJ33938. , 1| c 
442082 R41823 HsJ413 TM,EPrlMpWnase,SAM,m3. 



Hs.38018 
Hs.87539 
Hs.64311 



418836 AI655499 Hs.161712 
404091 NA 

450865 AI248013 Hs.106532 
424085 NH.002914HS.139226 
449650 AF055575 Hs.23838 
432304 AA932186 Hs.69297 
415669 NML005025HS.78589 
442942 AW167087 Hs.131562 
428795 R45503 Hs.97469 
426479 Y00272 Hs.184572 
400300 X03363 
447359 NM_012093Hs.18268 
402230 NA 

424687 J05070 Hs.151738 
432328 AI572739 Hs.195471 
432239 X81334 Hs.2936 
400286 NA 

425247 NM005940Hs.155324 
434737 AA828246 Hs.291884 
439310 AF086120 Hs.102793 
441111 A1806867 Hs.126594 
452355 N54926 Hs.29202 
427711 M31659 Hs.180408 
429353 AL1 17406 Hs.200102 
425325 X52730 Hs.1692 

phenyieffianobmine N-metnyitransferase 
448706 AW291095 Hs.21814 SSJM/pkinase, 
403593 NA . 

432777 AA564991 Hs.269477 
446232 AI281848 Hs.194691 
411165 NM_000169Hs.69089 
414117 W88559 Hs.1787 
411096 U80034 Hs.68583 
450506 ML0Q4460HS.418 
417975 AA641836 Hs.30085 
447752 M73700 Hs.105938 
427122 AW057736 Hs.323910 



SSJMBRCT.ank.ABCJfan.ABCJ/an DKFZP434G232 protein 
SS,TM,Y_phosphalase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS.PepUdase_M10,r«mopexin,SSPeptidase^rnatrix metalioproteinase 3 (stromelvsin 
SS.TM.7tnu1 l SSJM,7tm_1, angiotensin receptor 1 

SS,pktnase, ESTs 

f^^ n ^ rase . NH.030920*:Homo sapiens hypothetical pro 

deaft,ZU5 l TM,Adivin_recp,pKinase, BMP-R1B 
SS.PeptWase_M10 # hemopexin,SS l Peptidasejnatrix metalioproteinase 11 (MMP11; stro 
Ferricjeduct, Werrfc_reduct, NMJ>16931:Homo sapiens NADPH oxidase 4 1 

TM^BC.tranABCjnembrane, Eos Control 

SSJM,ubiquifin7trrL3^Fjeceptowushi dlubkjuitfn 
SS.p450,SS,p450 cytDchrome P450, subfamily IVB, polypept 

fS.SS inositol polyphosprate^-prwsphatase, ty 

SS,Cys_knot,SS Nome disease (pseudogEoma) 

death,ZU5,pkinase^ctivin,recp 1 ESTs 
SS,ZrLcaroOpept.PropepJM,SS,Pro^ B1 (tissue) 

GATase_2,SlS, g!utamine4ructose^6-phosphate transamln 

SS,TM.WD40,ubiquitIn,E1-E2 - ATPase.Cation peroxisomal famesytated protein 
S$ ribonucleotide reductase M2 polypeptide 

SSJM,Glycos_transURIdn_BJedin t DPP UDP^-acetyt-alpha-D^alactosamine^olyp 
^'3^f m - 1, prostaglandin E receptor3 (subtype EP3) 

SS.TM # dlslntegrln,Pep_M12B_propep.Reprol ESTs 

P^f- dual-specificity tyrosine-(YhJhosphoryl 

SS.aldedh^.akJedh, aldehyde dehydrogenase 3 family, member 

TM^disinlegrHReprolysin, a disintegrin and metalioproteinase doma 

„ Af? ESTs. Weakly similar to AF126780 1 refin 

GAF,PDEase chromosome 21 open reading frame 5 

ICEj)20,SSJCE4)10,lCE_p20,ICEj)10 l ICE_p 

ESTs; caisyntentn-2 
ESTs 

Target Exon 

ESTs, Weakly similar to 138588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent, L ly 
ESTs 

serine (or cysteine) proteinase inhibito 
ESTs 

ESTs, Highly similar to A39769 N^acetyli 
ceD division cycle 2, G1 to S and G2 to 



TM,Activin_recp,pkinase,death,ZU5, 
TM,7tm_3 ( ANF - receptor, 
zf-02H2 

SSMA.VffaLheiicase1.rrm, 
TM4on_trans,SS.TM,ion_trans l 
TM,7tmJ, 
SS^erpin, 

SS.ig.Sema.pkinase, 
SS.TM 

SS,pkinase,pkinase 



oo.p™ idae.pwnasa ceu division cycle 2, G1 to S and G2 to 

SS,TM,pWnase.Re(»p.L_domaln l SH2PH,FuriHER2 receptor tyrosine kinase (c-erb-b2. 
SS,adenylatekinase, adenylate kinase 5 

SS ( TM,p450, TargetExon 
SS < PeptWase_M10,fr^ l temopexin l SS.TM,Pepmatrix metalioproteinase 9 (gelatinase B 
6PF2K.PGAM. e^hosptofro^-Wnase/frw^^.e-bi 
SS.PeptidaseJ^IO.rtemopexin.SS.Peptidasejnatrix metalioproteinase 13 (cotlagenase 
SS,TM^BC_tranABC_membrane,SS CI 6000922^1)74991 03b3lrllT20903 hypothe 
^Peptidase _M10,hemopexin,SS matrix metafloprotefoase 1 1 (MMP11; stro 

UCH-1,pkiriase l OPR,Rhodanese^4)indlng,ESTs 
SS,TM,UDPGT,caseinJappa ESTs 
SS.TM^hosphodiest, ESTs 
TM,7tm_1,TM G protein-coupled receptor 34 

2-«™ solute carrier family 25 (mitochondrial 

SS,TM,ABCJraji,ABC jnembrane, ATP-binding cassette transporter MRP8 

SS,NNMT_PNMT_TEMr,SS^NMT_PNMT TEMT.STAR 

5.1 

r : v 7 - - — . interieukin 20 receptor, alpha 

CIDE-N,pkinase TargetExon 
alpha^amylase. ESTs 
SS,TM ( 7trrL.3,RIbosomaLL13 retinoic add Induced 3 

SS,Melibfase,BTK,PH,pkmase,SH2,SH3,Ribo galactosidase, alpha 
TM.^t/ans.KJetra, proteoBpid protein 1 (Pelizaeus-Merzbac 

oo^fr^' mrtochondrial Intermediate peptidase 

SS l DPPJV^term,Peptkfase - S9,SS,DPPJV^&roblast activation protein, alpha 
f |.JWsIn hypothetical protein FU23186 

SS,transfemn < 7tm_1 .transferrin, lactotransferrin 
SS l T^pkinase^ecep^o^rr^ l SH2,PH/unT€R2 receptor tyrosine kinase (c-erb-b2, 
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10.9 

10.4 

10.3 

10.3 

10.3 

9.8 

9.4 

9.1 

9.1 

8.8 

8.6 

8.3 

8.3 

8.3 

8.3 

8.2 

8.1 

7.8 

7.6 

7.6 

7.5 

7.3 

12 

7.2 

7.1 

6.9 

6.8 



6.7 
6.6 
6.6 
6.5 
6.5 
6.4 
6.2 
6.2 
6.2 
6.1 
6.1 
6.1 
6.1 
6.1 
5.8 
5.8 
5.6 
5.6 
5.4 
5.4 
5.2 
5.2 
5.2 
5.2 
5.1 



5.1 
5.1 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
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400181 NA S$,TM,3Beta_HSD, ENSP00000171555:CDNA FU10727 8s, clone 4.6 

452093 AA447453 Hs.27860 SS,TM,7tm_1, Homo sapiens mRNA; cDNA DKFZp586M0723 (f 4.6 

435542 M687376 Hs.269533 SS 1 pkinase,RhoGEF,lg,PH,SH3. ESTs 4.6 

417576 AA339449 Hs.82285 AlRS.rarmyLtransf,GARS.SS,GARS,AIRS,rorpto^ 4.6 

5 446089 A1860021 Hs.270651 pkinase ESTs, Moderately similar to A47582 B-cel 4.6 

424420 BE614743 Hs.146688 SS.TM.MAPEG, prostaglandin E synthase 4.5 

452190 H26735 Hs.91668 SS,TM,PH,SH2 ( Furin^.pkinase I Recep_U_ Homo sapiens done PP1498 unkrwvm mRNA 4.5 

419986 AI345455 HsJ8915 pkinase.OPR, GA-binding protein transcription factor, 4.5 

421582 AI910275 Hs.1406 SS,trefo!l.SS l TM ( WLrecepLa,SRCR,tryps trefoil factor 1 (pS2) 4.5 

10 446733 AA863360 Hs.26040 SS,TM,p450, ESTs, Weakly similar to fatty acid omega 4.4 

453060 AW294092 Hs.21594 SS/as,Y_phosphatase,ras hypothetical protein MGC1 5754 4.4 

400205 NA NM_006265*:Homo sapiens RAD21 (S. pombe) 4,4 

420854 AW296927 SSJM,PeptidaseJvl1, gb."Ul-H-BWr>ajc-(>O7-04Ji.s1 NCLCGAP^Su 4.3 

432690 AF181490 Hs.278627 SS,pyuedox,SS,RibosomaLL39 prenylcystelne lyase 4.3 

15 441560 F13386 Hs.7888 pkinase, Homo sapiens clone 23736 mRNA sequence 4.3 

416445 AL043004 Hs.79337 SS.pkinase, WAA01 35 protein 4.3 

439024 R96696 Hs.35598 SS.TM,trypsin,vwd,ig ESTs 4.3 

432882 NM_013257Hs.279696 pkinase,pkinase_C, senjrrrfglucocortfcaid regulated kinase-ii 4,2 

447754 AW073310 Hs.1 63533 pkinase, Homo sapiens cDNAFU 14142 fis, done MA 42 

20 453775 NM 002916Hs.35120 SSMA.PI3 PWJdra se ( PI3Ka,Pl3Kjbd,Pl3 replication factor C (activator 1) 4 (37 4.2 

431657 AI345227 Hs.105448 SS.TM.pkinase ESTs, Weakly simDar to B34087 hypotheti 4.1 

427899 AA829286 Hs.332053 SS.SAAuwtelnsABCjrenibraneABCJran, serum amyloid A1 4.1 

422938 NM.001809Hs.1594 SS.TM.thiolase. centromere protein A (17kD) 4.1 

418478 U38945 Hs.1174 ank,ank cycfin-dependent kinase inhibitor2A (me 4.1 

25 406922 S70284 SS.TM.Desaturase.SS gb:stearoyl-CoA desaturase [human, adipo 4.1 

439285 AL133916 Hs.172572 SS.ig.pWrcase.LRRNURRCT, hypothetical protein FU20093 4.1 

429922 Z97630 Hs.226117 SS,WmtoJustone,7tn\J H1 hlstone family, member 0 4.1 

420139 NM_005357Hs.95351 SS.TM,p450, lipase, hormone-sensitive 4.0 

425071 NM_013989Hs.154424 SS,T4jJeicrfinaseJ4jJekrfinase, detodinase, todothyronine, type II 4.0 

30 424511 BE300512 Hs.193557 SS.Yj)hosphatase,Band_41 ESTs, Moderately similar to ALU7JWMAN A 4.0 

428722 (J76456 Hs.190787 SS.T1MP, tissue inhibitor of metaDoproteinase 4 3.9 

428330 L22524 Hs.2256 SSPeptidaseJrf10,SS,PeptidaseJM0,herro 35 

414831 M31158 Hs.77439 SS,cNMP_bintfng,RIla,HMG_box protein kinase, cAMP-dependent, regulato 3.9 

413589 AW452631 Hs.313803 SS.abhydroiase ESTs, Highly similar to AF157833 1 nond 3.8 

35 418629 BE247550 Hs.86859 SS,SH2,PH,SS,TMWSH2/uiMce,pklnas grc^factorreceptor-bound protein 7 3.8 

413453 AA129640 Hs.128065 SS,Peptidase_C1,gpdh ESTs 3.8 

403943 p450.SS.p450 C5000355:gil4503225|reflNPJ00765.1|cyt 3.8 

444618 AV653785 Hs.173334, ELL-RELATED RNA POLYMERASE (I, ELONGATIO 3.8 

408761 AA057264 Hs.238936 SS,TM,7tm_1, ESTs, WeaWy similar to (defUne not ava 3.8 

40 427609 M26380 Hs.180876 SS.Gpase^UT.Sec7PH. fipoprotein lipase 3.7 

418203 X54942 Hs.83758 CKS.SS.CKS, CDC28 protein kinase 2 3.7 

454071 A1041793 Hs.42502 TM.7tmJ. ESTs 3.7 

424676 Y08565 Hs.151678 Glycos - trawfJ.Rldn_B^lectin,SS,Glycos UDP-N^acetyl^tpha-D-galactosamine:polyp 3.7 

457465 AW301344 Hs,122908 SS,Pribosyltran,Sultatase DNA replication factor 3.7 

45 417601 NM.014735HS.82292 PHD,pkinase,SS K1AA0215 gene product 3.7 

446619 AU076643 Hs.313 SS ( TM,efhand.»onJrans secreted phosphoprotein 1 [osteopontin, 3.7 

406625 Y13647 Hs.1 19597 SS,TM f Desaturase,SS stearoyW^jA desaturase (deHa-9-desatur 3.6 

428769 AW207175 Hs.106771 $S,7tmJ.SPRY, ESTs 3.6 

426310 NNL000909HS.169266 SS,TM,7tmJ, neuropeptide Y receptor Y1 3.6 

50 417531 NMJ)03157Hs.1087 SS J pkhase,vwa,vwa,G!yco_transfJ3 serine/threonine kinase 2 3.6 

444781 NM.014400Hs.H950 SS.PH,lactamase_0 GPI-anchored metastasis-associated prate 3.6 

431493 AI791493 Hs.129873 SS,p450,p450 ESTs, Weakly slmllarto A36036 cytochrom 3.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 3.6 

414175 AI308876 Hs.103849 TM,hern<H)exlnPeptWaseJrf10,hem^ 3.6 

55 455325 AW895719 TMjonJrans.KJetra, gb:QV4^N0039-290300-154-fi)6 NN0039 Homo 3.6 

429597 NM.003816Hs.2442 a dlstntegrin and metalloproteinase doma 3.6 

425320 U29344 Hs.83190 AcyLtransf.adrLa'nc,ketoa^y^syntppW fatty add synthase 3.5 

431854 AA383550 Hs.271699 IMS.SS polymerase (DNA directed) lota 3.5 

407104 S57296 Hs.323910 SS ( TM,SH2^H < p)dnase ( RecepJ_dorrain,Funv^rb-b2 avian erythroblaslic leukemia v 3.5 

60 449051 AW961400 Hs.333526 SS HER2 receptor tyrosine kinase (c-erb-b2, 3.5 

434398 AA121098 Hs.3838 pkinase.POLO_box,SS.pkinasePOLOJox, serunvtndudble kinase 3.5 

454042 H22570 Hs.172572 SSJg.pkinase.LRRNT.LRRCT, hypothetical protein FU20093 3.5 

407756 AA1 16021 Hs.38260 SS.UCH-1 l UCH-2.SS.7M.G^IiL.transpept ubiquifin specie protease 18 3.5 

401464 AF039241 Hs,9028 PepGdase_M24, hlstone deacetybse 5 3.5 

65 412970 AB026436 Hs.177534 DSP^KtyosirUaO, dual spedfldty phosphatase 10 3.5 

412049 N53437 Hs.1 8268 SS.adenylatekinase, adenylate kinase 5 3.5 

425776 U25128 Hs.1 59499 SS.WtrrUSSJMJtmJZ parathyroid hormone receptor 2 3.4 
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10 



15 



20 



25 



30 



40 



45 



407846 
406925 
445873 
418054 
406815 
410530 
407021 
421168 
431473 
408101 
422083 
411393 
435767 
433068 
426928 
414575 
445941 
444542. 
425741 
434228 
433264 
400419 
439750 
417757 
452194 
421458 
443767 
422648 
423431 
451264 
452110 
439963 
453941 
406654 



AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NH.002318HS.83354 
AA833930 Hs.288036 
M25809 Hs.64173 
U52077 

AF182277 Hs.330780 
AA825686 Hs.321176 
AW968504 Hs.123073 
NWL001141HS.111256 
AW797437 Hs.69771 
H73505 Hs.117874 
NW.006456HS.288215 
AF037062 Hs.172914 
H11257 Hs.22968 
AI267371 Hs.172636 
AI161293 Hs.280380 
AF052152 Hs.159412 
Z42047 Hs.283978 
D85782 Hs.3229 
AF084545 

AL359053 Hs.57664 



TM,ABCjru2rnbrane,ABCJran,Rto^ trarcactivator, wit 

SS,TMtnwsporU>rot,SWIB,RhoGAP l ^ dehydrogenase 1 (so 

/na m nn _i ■_ * a a o i — mnh«f AVkinriirMi nmfaln rvfrmlnsmir. 1 -I 



R19897 
AI694413 



Hs.106604 
Hs.332649 



NM.003654HS.104576 
BE562136 Hs.9736 
D86983 Hs.118893 
AA326062 
AI768235 

T47667 Hs.28005 
AW247529 Hs.6793 
U39817 Hs.36820 
L34041 



Hs.9739 

3 5 phosphate dehydrogenase 1 (so 
- R31770 Hs.23540 
U77413 Hs.100293 



453487 
420911 
443171 
452256 
432201 
419150 
444443 
426283 
436291 
450223 
424269 
448105 
452560 



SS,mn,PABP,pkinase,14-3-3,rrm 
SS t TM.mito_carr,LysyLoxIdase 
SS.IPPT, 

ATP-synLab.SS.7tfiLUTP-synLab 

SS,p450,SS 
SS 

pkinase, 

lipoxygenase.PLAT, 

SS,sushI,t]ypsin,vwa.frm,fibf1nogen_C l fn 

SS,Peptidase.S8.P l Peptidase - S8,P 

SS,Pribosyltran, 

SS,adh_shortTGF^)ela ( TGFbj)ropeptlde 

SS.ptdnaseJg, 

SS.SS.fipoxygenase.PLAT 

SS,SS.Pep0dase_M1 pEGF.gJecfin.CSUshl 

pkinase, 

SS,TM,7tmJ 



BE281128 Hs.9030 
AK000933 Hs.28661 
AJ538613 Hs.298241 
T29618 Hs.89640 
AI149286 Hs.55099 
NMJJ03937HS.169139 
BE568452 Hs.5101 
AA418204 HSJ241493 
AW137691 Hs.199754 
AW591433 Hs.298241 
BEG77084 Hs^36432 



SS.Peptidase.M1, 
TM.lntegfin_B l Ricln_BJectln,n7n 
death,ZU5,pkinase,Actwin_recp 1 
SS,WtmJ,ANF_receptor.sushi 
SS 

SS,PCI.RasGEF,rw>rmonejec,zf-C4, 
peroxidase,LRRCT, 
SS,p450,p450 
SSJrehalase 

SS,TM,Activin_recp,pkinase 
TM.p450.Ets 

SS,DEAD,HRDC,helicase_C, 
SS,TM.transpoitJ>rotSWlBWioGAP,DAG.PE 
3.1 

TM,7tmJ. 



poly{A)-binding protein, cytoplasmic 1-1 
(ysyl oxidase-Cke 2 
tRMA Isopentenylpyrophosphate transferas 
ATPase, H transporting, lysosomal (vacuo 
gb:Human mariner! transposase gene, comp 
cytochrome P450, subfamily I1B (phenobar 
ESTs, Weakly similar to S65824 reverse t 
CDC2-fe!at£d protein kinase 7 
arachidonate 15-fipoxygenase, second typ 
B-factor, properdin 
ESTs 

sialyltransfefase 

rctinol dehydrogenase 5 (11-ds and 9-ci 
Homo sapiens clone IMAGE:451939, mRNA se 
ESTs 

amhopep&dase 

Homo sapiens done 24628 mRNA sequence 
Homo sapiens PR02751 mRNA. complete cds 
cysteine dioxygenase. type I 
Target 

Homo sapiens mRNA full length insert cDN 
ESTs 

olfactory receptor, family 2, subfamily 
carbohydrate (keratan sulfate Gai-6) sul 
proteasome (prosome, macropaln) 26S subu 
Melanoma associated gene 
gb:EST29171 Cerebellum II Homo sapiens c 
gbwg82g08j(1 Soares_NSF_FSJW_OT_PAJl_S 
Homo sapiens cDNA FU1 1309 fis, clone PL 
platelet-activating factor acetythydrola 
Bloom syndrome 



SS.TM.7tnu1.rrm.SS 
TM,GDI,7tmJ, 

SS.TM.trypsin.SS.TM.trefoB.trypsln.tref 
TM.pkinase.fn3. 
SS 

SS.abhydrolase, 
SS.proJsomerase, 
SS.TM.7(nU,GPS 
SS,TM.trefoil.trypsin,trefoil 
SS.rrm > 2^anBP,pWnase,C2 l pldnase_C^AG ESTs 



ESTs 

(Winked N-acetyfglucosamine (GlcMAc) tr 
TONDU 

Homo sapiens cDNA FU10071 fis, done HE 
Transmembrane protease, serine 3 
TEK tyrosine kinase, endothelial (venous 
rab6 GTPase activating protein (GAP and 
kynurenlnase (L-kynurenlne hydrolase) 
protein regulator of cytokinesis 1 
natural Wllertumor recognition sequenc 
ESTs 

Transmembrane protease, serine 3 
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TABLE 21A 



Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

420854 197072.1 AW296927 AI684514 AI263168 AA281079 

423431 228162 1 AA326062 AA325758AW962182 

423945 233566.1 AA410943 AW948953 AA334202 AA332882 

451264 863988 1 AI768235 R31400 H29082 H23107 

455325 1279475 1 AW895719 N31451 N41451 

456207 165078.-1 AA193450 
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TABLE 21B 

Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this coiumn are Genbank Identifier (Gl) numbers. 'Dunham L et at. refers to the publication 

entitled The DNA sequence of human chromosome 22.* Dunham 1. et a!., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposiUon 



401045 8117619 Plus 90044-90184,91111-91345 

402230 9966312 Minus 29782-29932 

20 402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66496 

403593 6862650 Minus 62554-62712,69449-69602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-63229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 

10 

Pkey: Unique Eos probeset Identifier number 

ExAccm Exemplar Accession number. Genbank accession number 

UnigenelO: Unigene number 
1 5 Unigene Title: Unigene gene title 

R1: Ratio of 90* percentile tumor to 85 th percentile norma] breast tissue 



^ Pkey ExAccn UnigenetD UnigeneTitle R1 

400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 38.3 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 26.9 

25 451110 AI955040 Hs.265398 ESTs, WeaWysirnflartotransforma8on-r 25.8 

431211 M86849 Hs.323733 gap junction protein, beta 2, 26kD (conn 232 

418203 X54942 Hs.83758 CDC28 protein kinase 2 22.6 

407980 AA046309 gb:2f12fOU1 SoaresJetaLhearLNbHH19W 19.8 

414646 AA353776 Ks.901 CD48 antigen (B-ceD membrane protein) 18.9 

30 446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 A8033025 Hs.50081 K1AA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-Uke (rabkines 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HE 17.1 

453160 AI263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Scares fetal fiver spleen 16.1 



425692 D90041 Hs.1 55956 N-acetyltransferase 1 (arylamlne N-acety 16.1 

438533 A1440266 Hs,170673 ESTs, Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 155 

444342 NfO14398Hs.10887 s!m3ar to lysosome-associated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 K1AA0101 gene product 14.4 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

412446 AI768015 Hs.92127 ESTs 14.2 

45 415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformafion-r 13.8 

438209 AL120659 Hs.6111 ary^ydrocaibon receptor nuclear transl 13.8 
400205 NA NM_006265^omo sapiens RAD21 (S. pombe)13.5 

430965 M489732 Hs.154918 ESTs 13.4 

50 415263 AA948033 Hs.130853 ESTs 13.3 

451952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cydinBV . 13.2 

406685 M18728 gbMuman nonspecific crossreacting antig 13.0 

406690 M29540 Hs.220529 cajcirtoembryonfc antigen-related ce8 ad 12.8 

5 5 429925 NIVL000786HS.226213 cytochrome P450, 51 (lanosterol 14^lpha 12.8 

416498 U33632 Hs.79351 potassium channel, subfamfiy K, member 1 12.7 

432378 AJ493046 Hs.146133 ESTs 12.5 

441377 BE218239 Hs.202656 ESTs 125 
456207 AA193450 gbzr40e07.r1 Soares_NhHMPu_S1 Homo sapi12.4 

60 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

407811 AW190902 Hs.40098 cystelnelmotsuperfam^l.BMPantagon 12.2 

407178 AA195651 Hs.104106 ESTs 112 
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420931 AF044197 Hs.100431 small Inducible cytokine B subfamily (Cy 12.1 
421727 Y13153 Hs.107318 kynurenlne3^onooxygenase(kynuienlne3 12.0 
43440B AI031771 Hs.132586 ESTs J2.0 
446591 H44186 Hs.15456 PDZ domain containing 1 11* 
5 431385 BE178536 Hs.11090 membrane-spanning 4-domains, subfamily A 11.8 
443348 AW873596 Hs.182278 calmodulin 2 (phosphor/lase kinase, delt 11.7 
416602 NM_006159Hs.79389 nel (chickenHike 2 11.7 
433365 AF026944 Hs.293797 ESTs JJ* 
437866 AA156781 Hs.74170 metaltolhioneinlE (functional) 115 
10 412472 AW975398 Hs.293836 ESTs ]]* 
416030 H15261 Hs.21948 ESTs 1J-3 
439979 AW600291 Hs.6823 hypothetical protein FU10430 113 
420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 113 
411598 BE336654 Hs.70937 H3 histone family, member A 112 
15 423600 AI633559 Hs.310359 ESTs ]\2 
430770 AA765694 Hs.123296 ESTs ]10 
421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) 105 
452461 N78223 Hs.108106 transcription (actor 10.7 
409269 AA576953 Hs.22972 hypothetical protein FU13352 10.6 
20 417791 AW965339 Hs.111471 ESTs 1M 
447268 AI370413 Hs.36563 hypothetical protein FU2241 8 10.4 
424001 W67883 Hs,137476 paternally expressed 10 10.4 
447342 A1199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 
424905 NM 002497Hs.153704 NIMA {never in mitosis gene a)-related k 10.1 
25 453619 H8764B Hs.33922 Homo sapiens, done MGC:9084, mRNA. comp10.1 
442942 AW167087 Hs.131562 ESTs 1JJ 
434377 AW137148 Hs.306593 Homo sapiens cDNAFU! 1382 fs, done HE 10.1 
427217 AA399272 Hs.144341 ESTs JJJ.1 
445730 AI624342 Hs.170042 ESTs JW 
30 432887 A1926047 Hs.162859 ESTs 

452243 AL355715 Hs.28555 programmed ceil death 9 9-9 
424590 AW966399 Hs.46821 hypothetical protein FU20086 9.9 
432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 
438950 H23789 Hs.144530 EST 9-9 
3 5 418836 AI655499 Hs.161712 ESTs . 9.8 
430291 AV660345 Hs.238126 CGl-49 protein 9.8 
444665 BE613126 Hs.47783 B aggressive lymphoma gene 9-7 
407377 C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CGM47 protein 9.7 
40 443462 AI064690 Hs.171176 ESTs 9.7 
442145 AI022650 Hs.8117 erbb2-lnteracting protein ERBIN 9.7 
435570 AF212222 Hs.177812 uncharacterized bone marrow protein BM04 9.7 
439820 AL360204 Hs.283853 Homo sapiens mRNA full length Insert cDN 9.6 
428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 9,6 
45 449448 D60730 Hs.57471 ESTs 9.6 
433929 AI375499 Hs.27379 ESTs 95 
432731 R31 178 Hs.287820 fibronectinl 9.3 
411815 AA156679 Hs.125790 leudne-rich repeat-containing 2 9.3 
415385 R17798 Hs.7535 COBW-like protein 9.3 
50 422026 U80736 Hs.1 10826 trinudeotide repeat containing 9 9.2 
432596 AJ224741 Hs.278461 matriSn3 

439451 AF086270 Hs.278554 heterochrornatin-like protein 1 9.2 
423945 AA410943 gbrzt32h03.fi Soares ovary tumor NbHOT H 9.1 . 

442432 BE093589 Hs.38178 hypothetical protein FU23468 9.1 

5 5 446715 AI337735 Hs.173919 ESTs, Moderately similar to ZN91JHUMAN Z 9.0 
408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 
437021 AI076089 Hs.292239 ESTs 9 0 

428479 Y00272 Hs.184572 cell division cycte 2, G1 to S and G2 to 8.9 
428839 A1767756 Hs.82302 Homo sapiens cDNA FU 1481 4 fis, done NT 8.9 

60 402408 NA NNL030920*:Horno sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 calmegln M 
426327 W03242 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ8.8 
419519 AI198719 Hs.176376 ESTs 8.8 
440621 AW296024 Hs.150434 ESTs 8.8 

65 446142 AI754693 Hs.145968 ESTs 8.8 
418196 A1745649 Hs.26549 KIAA1708 protein 8.7 
447178 AW594641 Hs.192417 ESTs 8.7 
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427585 D31 152 Hs.179729 collagen, type X, alpha 1 (Schmid melaph 8.6 

415857 AA866115 Hs.127797 Homo sapiens cDNA FL) 11381 fis, clone HE 8.6 

435061 A1651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type XI, alpha 1 8.4 

437211 AA382207 Hs.5509 ecotroplc viral integration site2B 8.3 
437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 8.3 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.2 

440941 BE268362 Hs.7535 COBW-Cke protein 8.2 

1 0 429859 NMJ)0705QHs.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Nome disease (pseudognoma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypothetical protein FU 12910 8.1 

15 434424 A1811202 Hs.325335 Homo sapiens cDNA: RJ23523 fis, done L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs. Weakly similar to 2109260A B ceB 8.0 

457465 AW301344 Hs.122908 DNA replication factor 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-con 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 
418250 U29926 Hs.83918 adenosine monophosphate deaminase fisofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.902B histone deacetylase 5 7.9 

25 407242 M18728 gb^unrianiioflspeclficaossreacting antig 7.8 

422232 D43945 Hs.113274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280380 aminopeptidase 7.8 

436396 A1683487 Hs.152213 wingless-type MMTV integration site (ami 7.7 

30 437204 AL110216 Hs.12285 ESTs, Weakly slmaar to (55214 salivary 7.6 

408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU1 2910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FU10879 7.6 

426283 NM 003937HS.169139 kynureninase (L-kynurenine hydrolase) 7.5 

35 424687 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B 7.5 
446315 NM_016293Hs.14770 bridging integrator 2 7.5 
433426 H69125 Hs.133525 ESTs 7.5 
406639 M97711 gtoHuman T-ceO receptor (V beta 18.1, J 7.5 
420077 AW512260 Hs.87767 ESTs 7.4 
40 457332 AA961694 Hs.105187 Wnesin protein 9 gene 7.4 
422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 
447555 AI391662 Hs.160963 Homosapfers,cloiieMGC:12318,rnRNA l com7.4 
444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 
410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 
45 400268 NA NfVL003292^omo sapiens translocated prom 7.3 
439509 AF086332 Hs.58314 ESTs 7.3 
407771 AL138272 Hs.62713 ESTs 7.3 
407202 N58172 Hs.109370 ESTs 7.3 
433096 AU076803 Hs.282975 carboxylesterase 2 (Intestine, Over) 7.2 
50 422094 AF129535 Hs.272027 F-box only protein 5 ' 7.1 
430832 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 7.1 
430287 AW182459 Hs.125759 ESTs, WeaWy similar to LEU5_HUMAN LEUKE7.0 
423739 AA398155 Hs.97600 ESTs 7.0 
448212 AJ475858 gb:tc87d07.x1 NCLCGAPJXL1 Homo sapiens7.0 
55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 
454440 BE062906 Hs.28338 KIAA1546 protein 7.0 
444783 AK0O1468 Hs.62180 anlHin prosophQa Scraps homolog). act 7.0 
421373 AA608229 Hs.167771 ESTs 8.9 
431960 AW241821 Hs.301927 06.IA 8.9 
60 424704 A1263293 Hs.152096 cytochrome P450, subfamily IU (arachido 6.8 
449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 
439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 
414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 
441243 A1767056 Hs.193002 ESTs 8.7 
65 408380 AF123050 Hs.44532 diubiquifin 8.7 
422956 BE545072 Hs.122579 hypothetical protein FU10461 6.7 
446651 AA393907 Hs.97179 ESTs 6.7 
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419839 U24577 Hs.93304 phospholipase A2, group VII (platelet-ac 67 

437740 M810265 Hs.122915 ESTs 67 

421582 AI910275 Hs.1406 trefoil factor 1 (pS2) 67 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_.003816Hs.2442 a disintegrin and metalloproteinase doma 6.6 

422634 NM_016010Hs.118821 CGI-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 AI798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to S38383 SEB4B pro 6.5 

1 0 449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.4 

409757 NMJJ01898HS.123114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens ctg5 mRNA, partial sequence 6.4 

456938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 

15 424902 NMJHJ3866HS.153687 inositol polyphosphate-4-pnosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B^fl CLL/r^homa 11B (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for Immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp 6.3 

20 433001 AF217513 Hs.279905 clone HQ0310PRO0310p1 6.3 
434340 AI193043 Hs.1 28685 ESTs, Weakly similar to T17226 hypothetl ,6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:gi|2117372lpir||l65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidine kinase 6.1 

25 430447 W17064 Hs.332848 SW1/SNF related, matrix associated, acti 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 
428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromelysin 6.1 
30 428248 AI126772 Hs.40479 ESTs 6.0 
420344 BE463721 Hs.97101 putative G protein-coupled receptor 6.0 
453392 U23752 Hs.32964 SRY (sex determining region Y]hbox 11 6.0 
425397 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 6.0 
418007 M13509 Hs.83169 matrix metaltoprotelnase 1 (MMP1 ; inters 6.0 
35 428585 AB007863 Hs.185140 KIAA0403 protein 6.0 
437608 AA761605 Hs.292308 ESTs, Weakly similar to ALU1_HUMAN ALU S6.0 
427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 
406687 M31126 Hs.272620 matrix metalloproteinase 11 (MMP11;stro 6.0 
418092 R45154 Hs.106604 ESTs 6.0 
40 447051 AW139130 Hs.160951 ESTs, Weakly similar to Con1 [H^apiens] 6.0 
441233 AA972965 Hs.135568 ESTs 6.0 
432239 X81334 Hs.2936 matrix metalloproteinase 13 (coOagenase 6.0 
435106 AA100847 Hs.193380 ESTs, Highly similar to AF174600 1 F-box 5.9 
435525 AI831297 Hs.123310 ESTs 5.9 
45 458809 AW972512 Hs.20985 sin3^ssocialed polypeptide, 30kD 5.9 
410785 AW803341 gb;IL24ilW0079-090300-050-O03 UM0079 Homo5.9 
422576 BE548555 Hs.1 18554 CGI-83 protein 5.9 
451398 AI793124 Hs.144479 ESTs 5.9 
441881 AW968904 Hs.179566 hypothetical protein FU22624 5.8 
50 412022 AI005043 Hs^4143 Wiskott-Aldrich syndrome protein interac 5.8 
416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 5.8 
447350 A1375572 Hs.172634 ESTs 5.8 
434094 AA305599 Hs.238205 hypothetical protein PRO2013 5.8 
409151 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. c 5.8 
55 448807 A1571940 Hs.7549 ESTs 5.8 
452281 T93500 Hs.28792 Homo sapiens cONA FU1 1041 fis, clone PL 5.8 
421281 AI299139 Hs.17517 ESTs 5.8 
430361 AI033965 Hs.239926 sterol-C4-methyl oxkJase-like 5.8 
400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10; sir 5.7 
60 440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 57 
434674 AA831879 Hs.136985 ESTs 57 
426320 W47595 Hs.169300 transforming growth factor, beta 2 57 
452401 NMJ0071 1 5Hs.29352 tumor necrosis factor, alpha-Induced pro 57 
448663 BE614599 Hs.105823 hypothetical protein MGC14797 57 
65 438199 AW016531 Hs.122147 ESTs 57 
446203 Z47553 Hs.14286 flavin containing monooxygenase 5 57 
428336 AA503115 Hs.183752 mtcroseminoprotefn, beta- 5.6 
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430379 AF134149 Hs.240395 potassium channel, subfamily K, member 6 5.6 
422835 BE218705 Hs.121378 metaitothioneln-like 5, testis-specific 5.6 
444758 AL044878 Hs.11899 3-hydroxy^3^nethyIg!utary«^en2ym8Are 5.6 
443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 
5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 

447078 AW8B5727 Hs.301570 ESTs 5.6 

432015 AL157504 Hs.159115 Homo sapiens mRNA; cDNA OKFZp586O0724 (f5.5 
438691 AA906288 Hs.212184 ESTs 5.5 
439809 R41395 Hs.101774 hypothetical protein FU23045 5.5 
10 415786 AW419195 Hs.257924 hypothetical protein FU1 3782 5.5 
456373 BE247706 Hs.89751 nienibrane^panning4^omair« l siibfamflyA 5.5 
401645 NA C16001440 # :gi|12330704|gblAAG52890.1lAF35.5 
437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5.5 
445885 AI734009 Hs.127699 KIAA1603 protein 5.4 
15 439138 AI742605 Hs.193696 ESTs 5.4 
440270 NMJ15985Hs.7120 cytokine receptor-like molecule 9 5.4 
437536 X91221 Hs.144465 ESTs 5.4 
438167 R28363 Hs.24286 ESTs 5.4 
452741 BE392914 Hs.30503 -Homo sapiens cDNA FU11344 As, clone PL 5.4 
20 426214 H59846 Hs.128355 ESTs, Moderately similar to AUI7JUJMAN A 5.4 
413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 5.4 
422867 L32137 Hs.1584 cartilage oGgomerlc matrix protein (COM 5.4 
434263 N34895 Hs.44648 ESTs 5.4 
446382 AW205168 Hs.150823 ESTs 5.4 
25 422406 AF025441 Hs.116206 Opa-interacting protein 5 5.3 
438321 AA576635 Hs.6153 CGI-48 protein 5.3 
418310 AA814100 Hs.86693 ESTs 5.3 
419625 U91616 Hs.91640 nudear factor of kappa light potypeptid 5.3 
450701 H39960 Hs^88467 Homo sapiens cDNA RJ12280 fis, done MA 5.3 
30 445900 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 52 
449051 AW961400 Hs333526 HER2 receptor tyrosine kinase (c-erb-b2. 5.2 
418758 AW959311 Hs.172012 hypothetical protein DKFZp434J037 5.2 
431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 
417079 U65590 Hs.81134 interteukin 1 receptor antagonist 5.2 
35 421928 AF013758 Hs.109643 polyadenylatB binding protein-interactin 5.2 
428804 AK000713 Hs.193736 hypothetical protein FU20706 5.2 
427427 AF077345 Hs.177936 ESTs 5.2 
403485 C3001813*:gii12737279|ref|XP - 012163.1ik 5.2 

422168 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psorias 5.1 
40 421937 AI878857 Hs.109706 hematological and neurological expressed 5.1 
426752 X69490 Hs.172004 titin 5.1 
453310 X70697 Hs,553 solute carrier family 6 (neurotransmltte 5.1 
423198 M81933 Hs.1634 cefl division cycle 25A 5.1 
412281 A1810054 Hs.14119 ESTs 5.1 
45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 
453931 AL121278 Hs.25144 ESTs 5.1 
404347 TaraetExon 5.1 

431808 M30703 Hs.270833 arrphiregufin (schwannoma-derived growth 5.1 
429113 D28235 Hs.198384 prostaglandirvendoperoxide synthase 2 (p 5.1 
50 436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 
450603 R43646 Hs.12422 ESTs 5.1 
434725 AK000796 Hs.4104 hypothetical protein 5.0 
435981 H74319 Hs.188620 ESTs 5.0 
407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUFJiUMAN till 5.0 
55 431689 AA305688 Hs.267695 UDP^GatbetaGlcNAcbeta1>galactosyllr 5.0 
405348 NA C7001664^i|12B98061|dbjjBAB21849.1| (AB 5.0 

436196 AK001084 Hs.333498 Homo sapiens cDNA FU10222 fis, clone HE 5.0 
437065 AL036450 Hs.103238 ESTs 5.0 
410196 AI936442 Hs.59838 hypothetical protein FU10808 5.0 
60 429412 NMJX)6235Hs.2407 POU domain, dass 2, associating factor 5.0 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteoponfin, 4.9 
403329 NA Target Exon 4.9 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142 mRNA sequ4.9 
442441 AI820662 Hs.129598 ESTs 4.9 
65 430375 AW371048 Hs.93758 H4 histone family, member H 4.9 
424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo45 

408873 AL046017 Hs.182278 calmodulin 2 (phosphoryiase kinase, delt 4.9 
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407910 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.9 

432606 NNL002104HS.3066 granzymeK (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.19199Q ESTs 4.8 
452020 AA722012 Hs.255757 ESTs, Weakfy similar to AT2A_HUMAN POTEN4.8 

5 449048 Z45051 Hs.22920 similar to S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs.182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078849 Ks.266483 dyneln light cnalrvA 4.8 
423575 C18863 Hs.163443 Homo sapiens cONA RJ11576 lis, done HE 4.8 

444246 H93281 Hs.10710 hypothetical protein FU20417 4.8 

10 421524 AA312082 Hs.105445 GDNF famiiy recepiof alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryotic translation termination facto 4.8 

419078 M93119 Hs.89584 Insullnoma-assoclated 1 4.8 

15 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Ha.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty acid binding protein 7, brain 4.7 

419968 X04430 Hs.93913 Interteukin 6 (interferon, beta 2) 4.7 

424326 NMJJ14479HS.145296 disintegrin protease 4.7 

20 431585 BE242803 Hs.262823 hypothetical protein FU10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FU22463 fis, done H 4.7 

416814 AW192307 Hs.80042 dolichyl^^lc^n9GlcNAc2-PP-doIichylgl 4.7 

439897 NMJ)15310Hs.6763 KIAA0942 protein 4.7 

429687 AI675749 Hs.211608 nudeoporin 153kD 4.7 

25 422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 NMJW039G:Homo sapiens dwroideremia (Ra4.6 

432435 BE218886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU11360; artemis p 4.6 

425354 U62027 Hs.155935 complement component 3a receptor 1 4.6 

30 436027 AI864053 Hs.39972 ESTs. WeaHy similar to I38588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 A1916071 Hs.15607 Homo sapiens Fanconl anemia comptementat4.6 

35 411678 AI907114 Hs.71465 squalene epoxldase 4.6 

456844 AI264155 Hs.152981 CDP^iacytglycerd syntee (phosphatida 4.6 
448072 AM59306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 
423782 AI472209 Hs323117 ESTs 45 
40 447388 AW630534 Hs.76277 Homo sapiens, done MGC:9381,mRNA,comp4.5 
448140 AF146761 Hs.20450 BCM-like membrane protein precursor 4.5 
452561 A1692181 Hs.49169 KIAA1634 protein 4.5 
425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 
428801 AW277121 Hs.254881 ESTs 4.5 
45 428500 AI815395 Hs.184641 fatty add desaturase2 4.5 
426075 AW513691 Hs.270149 ESTs, Weakly similar to 21 09260A B cefl 4.4 
437259 AI377755 Hs.120695 ESTs 4.4 
400409 AF153341 Hs.283954 Homo sapiens'winged heOx/forkhead trans 4.4 
412863 AA121673 Hs.59757 zinc finger protein 281 4.4 
50 426989 AI815206 Hs.99395 ESTs 4.4 
4018^6 Target Exon 4.4 
418819 AA228776 Hs.191721 ESTs 4.4 
406348 Target Exon 4.4 
412138 AW895387 gb:QV4^N0038^300300-157«c10 NN0038 Homo4.4 
55 428550 AW297880 Hs.98661 ESTs 4.4 

411743 AW862214 gb:QV4XT0361-301299-074-b05CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 
423291 NMJQ04129Hs.126590 guanyiate cyclase 1, soluble, beta 2 4.4 
423456 AL110151 Hs.128797 DKFZP586D0824 protein 4.4 
60 452190 H26735 Hsj91668 Homo sapiens done PP1498 unknown mRNA4.4 
424871 NNL004525HS.153595 low density Bpoprotein-related protein 4 J 
429575 AA706003 Hs.99387 ESTs 4.3 
429922 Z97630 Hs.226117 H1 hlstone family, member 0 4.3 
421379 Y15221 Hs.103982 small Inducible cytokine subfamily B (Cy 43 
65 400300 X03363 HER2 receptor tyrosine Wnase (c-erb-b2, 4.3 

437258 AL041243 Hs.174104 ESTs 4.3 
446595 T57448 Hs.15467 hypothetical protein FU20725 4.3 
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403011 NA ENSP00000215330 # :Probabteserine/lhreoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cONA FU11570 fis, done HE 4.3 

418661 NMJ)01949Hs.1189 E2F transcripfion factor 3 4.3 

407786 AA687538 Hs.38972 tetraspanl 4.3 

5 429183 AB0146O4 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein FU14007 4.3 

441029 AI091795 Hs.179246 ESTs 43 

452194 AI694413 Hs.332649 oUactory receptor. family 2, subfamily 43 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 4.2 
10 410102 AW248508 Hs.279727 Homo sapiens cONA FU14035 fis, clone HE M 

452110 T47667 Hs.28005 Homo sapiens cDNA FU1 1309 fis, clone PL 42 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs. Weakly similar to T47184 hypotheti 4.2 

15 443646 AI085198 Hs.164226 ESTs 42 

419169 AW851980 Hs.262346 ESTs. Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs.16244 mitofic spindle coileoVcoll related prot 42 

423242 AL039402 Hs.125783 DEME-6 protein 42 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 42 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FU10514 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NNL005940HS.155324 matrix metaltoproteinase 11 (MMP11; stro 4.1 

412886 AF041163 Hs.74647 Human T-cel receptor active alpha-chain 4.1 
441153 BE562826 gb:601336534F1 NIH_MGCJ4 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR dass 1) 4.1 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 4.1 

35 405850 NA Target Exon 4.1 

440283 A1732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosolic protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-fector, properdin 4.0 

425704 U79293 Hs.159264 Human done 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topoisomerase (DNA) ll binding protein 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

45 443147 AI034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1,. soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cyde 20, S. cerevi 3.9 
50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cONA DKFZp564B1264 (13.9 

413762 AW411479 Hs.846 FKSO^tnding protein 4 (59kD) 3.9 
404580 NM_014112*:Homo sapiens trichominophala 3.9 

452046 AB018345 Hs.27657 WAA0802 protein 3.9 
459587 AA031956 gb:zk15e04.s1 Soares jxegnanLuterus.NbH 3.9 

55 416658 U03272 Hs.79432 fibrillin 2 (congenital contractura! ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL1 17406 Hs.200102 ATP-bkiding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

418918 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell 3.9 

60 421977 W94197 Hs.1 10165 ribosomal protein 126 homotog 3.9 

442567 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily HB (phenobar 3.9 
431701 AW935490 Hs.14658 Human chromosome 5q13.1 dona 5G8 mRNA3.9 
418526 BE019020 Hs.85838 solute carrier family 16 (rronocarboxytic 3.9 
65 414998 NM_002543Hs.77729 oxidised tow density lipoprotein (lectin 3.9 
422790 AA809875 Hs.25933 ESTs 35 
419741 NH„007019Hs.93002 ubiquitin carrier protefri E2-C 35 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3.9 

458814 A1498957 Hs.170861 ESTs, WeaWy similar to Z195_HUMAN ZINC 3,8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NMJ)02267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.8 

5 409425 U40462 Hs.54452 zinc finger protein, subfamily 1 A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypothetical protein FU 12820 3.8 

424023 AF055C84 Hs.153892 Homo sapiens cDNAFU 14354 frs. done Y7 3.8 

400021 AFFX control - HUMJSGF3A/M97935.MA 3.8 

453403 BE466639 Hs.61779 Homo sapiens cONA FU 13591 fis. done PL 3.8 

10 445941 Ai26737i Hs.172636 ESTs 3.8 
434378 AA631739 Hs.335440 EST .3.8 

429220 AW207205 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 tine-1 pr 3.8 

401045 C11001883*:gi[6753278|ref|NP„033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NMJW1838HS.1652 chemokine (C-C motif) receptor 7 3.8 

447630 AI660149 Hs.44865 lymphoid enhancer-binding factor 1 3.8 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 blgtycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NNL003512Hs.28777 H2A histone family, member L 3.8 

427811 M81057 Hs.180884 carboxypeptldase B1 (tissue) 3.8 

415679 AA165232 Hs.222069 ESTs 3.8 
437330 AL353944 Hs.50115 Homo sapiens mRNA; cDNA DKFZp761J11 12 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 3.7 

400286 NA C16000922^lI7499103lplrtF20903 hypothe 3.7 

420281 AI623693 Hs.191533 ESTs 37 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 

417541 A1992191 Hs.180040 hypothetical protein FU22439 3.7 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AI916662 Hs.211577 kinecfin 1 (kinesln receptor) 3.7 

457001 J03258 Hs.2062 vitamin D(1 ,25- dihydroxyvitamin D3) re 3.7 
424109 AW406878 gb^mF-BL0-adg-g-06-(WJI.r1 NIHJ4GC_373.7 

417022 NMJ14737Hs.80905 Ras association (RalGDS/AF-6) domain fam 3.7 
35 436222 AI208737 Hs.122810 Homo sapiens cDNA FU 1 1489 fb, clone HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FU21784 fis, done H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-lt 3.7 

421662 NMJ)14141Hs.106552 cefl recognition molecule Caspr2 3.7 

40 441633 AW958544 Hs.1 12242 normal mucosa of esophagus spedfid 3.7 

408761 AA057264 Hs.238936 ESTs, WeaWy similar to (defline not ava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 A1884911 Hs.32989 receptor (caldtonin) activity modifying 3.7 

45 448918 AB011152 Hs.22572 KIAA0580 protein 3.7 

413936 AF1 13676 Hs.297681 serine (or cysteine) proteinase Inhibito 3.6 

448069 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.6 

453313 BE005771 Hs.153746 hypothetical protein FU22490 3.6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypothetl 3.6 

50 419941 X98654 Hs.93837 phosphatWyUnosftol transfer protein, m 3.6 

402397 AF188625 Hs.189507 phospholipaseA2, group 110 3.6 

430378 Z29572 Hs.2556 tumor necrosis tactor receptor superfamt 3.6 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NM.000458HS.169853 transcripfion factor 2, hepatic: LF-83; 3.6 

55 431843 AA516420 Hs.183526 ESTs. Weakly similar to 138022 hypotheti 3.6 
426878 BE069341 gb:QV3^T0381-27010(M)73<08BT0381 Homo3£ 

434061 AW024973 Hs^83675 NPD009 protein 3.6 
445292 AV653264 Hs.13982 Homo sapiens cDNA FU14666 fis, done NT 3.6 
452101 T60296 Hs.10844 Homo sapiens cDNA FU14476 fis, done MA 3.6 

60 427581 NM014788HS.179703 WAA01 29 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NMJW0402HS.80206 gIucose-6-phosphate dehydrogenase 3.6 
410386 W26187 Hs.3327 Homo sapiens cONA: FU22219 fis, clone H 3.6 

440516 S42303 Hs.161 cadherfn 2, type I.Nkadherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retinolc add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FU23548 3.6 
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409619 AKDQ1015 Hs.55220 BCL24SSodated aihanogene 2 36 

430044 AA464510 Hs.152812 ESTs * 36 

430629 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 Werleukin 2 receptor, gamma (severe co 3.6 

5 439247 AF088020 Hs.46767 EST " 36 

431542 H63010 Hs.5740 ESTs is 

430713 AA351647 Hs.2642 eukaryofic translation elongation factor 3.5 

434986 A1418055 Hs.161160 ESTs "* 3.5 

437748 AF234882 Ha.5814 suppression of tomorigeniaty 7 3.5 

10 418322 AA284166 Hs.84113 cyduwlependent kinase Inhibitor 3 (COK 3.5 

439569 AW602166 Hs.222399 CE6P1 protein 3 5 
459583 Ai907673 gb:IL-BT1 52-080399-004 BT152 Homo sapien3.5 

403212 NMJ)19595:Homo sapiens Intersectin 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical protein FU20716 3.5 
15 453968 AA847843 Hs.62711 Homo sapiens, done IMAGE3351295, mRNA3.5 

436338 W92147 Hs.118394 ESTs 3.5 

422890 243784 Hs.75893 ankyrin3, node of Ranvier(ankyrinG) 3.5 
442295 AI827248 Hs.224398 Homo sapiens cONA FU11469 fis, clone HE 3.5 

417975 AA641836 Hs30085 hypothetical proteki FU23186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein FLJ1 1273 ■ 3.5 

439926 AW014875 Hs.137007 ESTs 3.5 

445873 AA250970 Hs.251946 pofy(A)-bindtng protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU12691 3.4 
422128 AW881145 gb:Q\A)OTOO^<)10400-182-a070T0033Homo 3.4 

Id 414921 BE390551 Hs.TTm steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs " 34 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 34 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 AJ681545 Hs.152982 hypolhe0cal protein FLJ131 17 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.1 18722 furosyltransferase8(alpria(1,6)fucosy 3.4 

423554 M90516 Hs.1674 g!utamir)e-fructose^csphate transamln 3.4 

35 442619 AA447492 Hs.20183 ESTs, Weakly similar to AF 164793 1 prate 3.4 
402359 NA Cl9001991^i|126^6111bb|AAK00751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 34 

415208 F01020 Hs.172004 titin 34 

452853 AA812633 Hs.10845 ESTs 34 

40 429345 R11141 Hs.199695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidylpeptidase 111 3.4 

412115 AKD01763 Hs.73239 hypothetical protein FU10901 3.4 

432180 Y18418 Hs.272822 RuvB (E coS homoiogHike 1 3.4 

428977 AK0014O4 Hs.194698 cyclinB2 3,4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGE:3616574, mRNA.3.4 

436895 AF037335 Hs.5338 carbonic anhydrase XII (tumorantgenH 3.4 

443378 AW392550 Hs.9280 protaasome (prosome, macropain) subunit. 3.4 

428450 ML014791HS.184339 KIAA0175 gene product 33 

50 449571 AW016812 Hs.200266 ESTs 33 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NH.000505Hs.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor4ike protein 3.3 

418327 U70370 Hs.84136 paired-like torneooomaln transcription fe 33 

55 449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobiity gr 3.3 

425999 AW513051 Hs.332981 ESTs, Weakly similar to I38022 hypotheti 33 

430280 AA361258 Hs.237868 mterieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squatene epoxidase 33 

426516 BE262660 Hs.170197 glutamk>oxaJc^fictransarninase2, mit 33 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.173466 ras-related C3 botullnum toxin substrate 3.3 

426429 X73114 Hs. 169849 myosin-binding protein C, slow-type 33 

446163 AA026880 Hs.25252 prolactin receptor 3.3 

428566 U41763 Hs. 18491 6 damrin, heavy porypeptkJe-fike 1 33 

65 418641 BE243136 Hs.86947 a a"islntegnn and metaBooroteinasa doma 33 

436293 AI601188 Hs.120910 ESTs 33 

411257 AA628987 Hs. 115274 ESTs, Highly similar to IHHJIUMAN INDIAN 33 
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430253 AK001514 Hs.236844 hypothetical protein RJ10652 3.3 
430066 AI929659 Hs.237825 signal recognition particle 72kD 3.3 
436469 AK001455 Hs.5198 Down syndrome critical region gene 2 3.3 
437786 BE142681 Hs.155573 polymerase (DNA directed), eta 3.3 
5 444079 H09048 Hs.23606 ESTs 3.3 
457183 H91882 Hs.1 18569 Dvt-binding protein IDAX {inhibition of 3.3 
431215 AA496078 Hs.121554 Human DNA sequence from done RP1 1-218C13.3 
424563 AA446932 Hs.151428 ret finger protein 2 3.3 
450828 AW270655 Hs.193804 ESTs 3.3 

10 408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1544 protein, 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: FU22554 fis, clone H 3.3 
426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 {(3.3 
439237 AW408158 Hs.318893 ESTs, WeaWy simBar to A47582 B-ceD gr 3.3 
422616 BE300330 Hs.1 18725 selenophosphate synthetase 2 3.3 

15 443247 BE614387 Hs.333893 c-Myc target JP01 3.3 
406663 U24683 Hs.302063 Immunoglobulin heavy constant mu 3.3 
434137 AA907734 Hs.124895 ESTs 3.3 
408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 3.3 
439101 C01765 Hs.38750 hypothetical protein FU1 1526 3.3 

20 408221 AA912183 Hs.47447 ESTs 3.3 
447519 U46258 Hs.339665 ESTs 3.3 
404755 NA Target Exon 3.3 

451871 AI821005 Hs.118599 ESTs 3.2 
420319 AW406289 Hs.96593 hypothetical protein 3.2 

25 430580 AA806105 Hs.300697 Immunoglobulin heavy constant gamma 3 (G 3.2 
400202 NA NM.002795*:Homo sapiens proteasome (pros3.2 

400222 NA NM_G02082*:Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly simBar to I38022 hypothefi 3.2 
458098 BE550224 Hs.74170 metallothloneln 1E (functional) 3.2 

30 430589 AJ002744 Hs.246315 UDP-N-acetyUlpha-T>galactosamlne:polyp 3.2 
431563 A1027643 Hs.120912 ESTs 3,2 
442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.2 
422309 U79745 Hs.1 14924 solute carrier famfiy 16 (rnonocarboxytic 3.2 
419703 A1793257 Hs.128151 ESTs 3.2 

35 420380 AA640891 Hs,102406 ESTs 3.2 
410853 H04588 Hs.30469 ESTs 3.2 
454417 AI244459 Hs.110826 trinucleotide repeat containing 9 3.2 
432745 A1821926 gb:nt78f05.x5 NCI_CGAP,Pr3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RNA) 111 (DNA directed) poly 3.2 

40 415339 NNL015156HS.78398 K1AA0071 protein 3.2 
426384 AI472078 Hs.303662 ESTs 32 
448030 N30714 Hs.325960 mennbrane^panning domains, subfamily A 3.2 
418739 AA310964 Hs.88012 SHP2 interacting transrrernbrane adaptor 3.2 
442053 R35343 Hs.24968 Human DNA sequence from done RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3.2 
427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333, mRNA, com3.2 
412228 AW503785 Hs.73792 complement component (3d/Epsteln Barr v| 3.2 
452304 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine Z2 
453953 AW408337 Hs.36972 CD7 anfigen (p41) 3.2 

50 407758 D50915 Hs.38365 WAA01 25 gene product 3.2 
451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 
430015 AW768399 Hs.1 12157 ESTs 3.2 
433313 W20128 Hs.296039 ESTs 3.2 
418334 AA319233 Hs.5521 ESTs 32 

55 450223 AA418204 Hs.241493 natural kfllertumor recognition sequenc 3.2 
454365 AW966728 Hs.54642 methionine adenosyltransferase 11, beta Z2 
451128 AL118668 gb:DKFZp761 10310 j\ 761 (synonym: hamy2)32 

417793 AW405434 Hs.82575 small nu^ribonucteoprotelnpolypept 3.2 
428027 U22029 Hs.334345 cytochrome P450 l subtamByllA(phenobar 3.2 

60 441197 BE244638 Hs.166 sterol regulatory element binding transc 3.2 
424634 NM_003613Hs.151407 cartilage Intermediate layer protein, nu Z2 
419986 AI345455 Hs.78915 GA-binding protein transcription factor. 3.2 
416714 AF283770 Hs.79630 CD79A antigen (immunoglobufin-assodated 3.2 
449465 NW.004380HS.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.112405 S100 calcium-binding protein A9 (calgran 3.2 
409079 W87707 Hs.82065 interieuHn 6 signal transducer (gp130, 22 
423551 AA327598 Hs.233785 ESTs 3.2 
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453553 AA036849 Hs.61829 Homo sapiens cDNA FLJ12763 fis, clone NT 3.2 
442580 AI733682 Hs.130239 ESTs 3.2 
456079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 99232 
425700 AF076292 Hs.15925! forkhead box H1 3.2 
5 417124 BE122762 Hs.25338 ESTs 3.2 
407104 S57296 Hs.323910 v-erb-b-2 avian erythroblastic leukemia v 3.2 
442215 AI703172 Hs.129005 ESTs, Weakly similar to 2109260A B cell 3.1 
430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 
425317 AW205118 Hs.210546 Interieukin 21 receptor 3.1 
10 426095 A1278023 Hs.89986 ESTs 3.1 
442313 BE388898 Hs.8215 hypothetical protein FU11307 3.1 
424709 AL137589 Hs.152149 hypothelical protein DKFZp434KQ410 3.1 
429671 BE379335 Hs.211594 proteasome (prosome, macropaEn) 26S subu 3.1 
432715 AA247152 Hs.200483 ESTs, Weakly similar to K1AA1074 protein 3.1 
15 431574 AW572659 Hs.261373 hypothetical protein dJ434014.3 3.1 
436876 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 3.1 
405017 NA Target Exon 3.1 

433805 AA706910 Hs.112742 ESTs 3.1 
437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 
20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic alpha p 3.1 
422083 NNL001 141Hs.1 1 1256 arachidonate 15-Ttpoxygenase, second typ 3.1 
413507 BE145360 Hs.190064 ESTs, Weakry sImBar to 138022 hypotheti 3.1 
415989 A1267700 Hs.317584 ESTs 3.1 
422907 A1879263 Hs.6986 Human glucose transporter pseudogene 3,1 
25 425548 AA890023 Hs.1906 prolactin receptor 3.1 
422599 BE387202 Hs.118638 non-metastarjc cells 1, protein (NM23A) 3.1 
439963 AW247529 Hs.6793 ptatetet-acth/atlng tactor acetylhydrola 3.1 
453883 AI638516 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 
458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 
30 418478 U38945 Hs.1174 cydirwiependentldnase inhibitor 2A(me 3.1 
400B14 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 AA190712 gb:zp87f09.r1 Stratagene HeLa cefl s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 
35 437036 AI571514 Hs.133022 ESTs 3.1 
449523 NM_000579Hs.54443 chernokine (C-C motif) receptor 5 3.1 
406642 AJ245210 gbtaosapierismRNAtbr^rrujnoglobuIin 3.1 

406624 AF052762 gb:Homo sapiens clone csneg8-1 immunogb 3.1 

421924 BE514514 Hs.109606 coronin, acfirvbkiding protein, 1A 3.1 
40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhibito 3.1 
416379 N38857 Hs.203933 ESTs 3.1 
422823 D89974 Hs.121102 vanin2 3.1 
433904 AI399956 Hs.208956 ESTs 3.1 
421904 BE143533 Hs.109309 hypothetical protein FU20035 3.1 
45 428834 AW899713 Hs.339315 ESTs 3.1 
436043 AW963838 Hs.168830 Homo sapiens cDNA FU12136 fis, done MA 3.1 
452823 AB012124 Hs.30696 transcription factor-like 5 (basic helix 3.1 
405381 NA Target Exon 3.1 

428746 AW503820 Hs.192861 Spl-B transcription factor (SpM/PU.1 r 3.1 
50 435147 AL133731 Hs.4774 Homo sapiens infVlA;cDNADKFZp761C1712(f3.1 
425782 U66468 Hs.159525 cell growth regulatory with EF-hareJ doma 3.1 
423306 W88562 Hs.108198 ESTs 3.1 
419123 AA234276 Hs.88253 ESTs 3.1 
438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 
55 417105 X60992 Hs.81226 CD6 antigen 3.0 
428361 NM_015905Hs.183858 transcriptional intermediary factor 1 3.0 
417880 BE241595 Hs.82848 selecfin L (lyrnphocyte adhesion molecule 3.0 
402606 NM_024626:Hamo sapiens hypothetical prot 3.0 

401451 NMJHM496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.111496 Homo sapiens cDNA FU1 1643 fe, done HE 3.0 
409518 BE384836 Hs.3454 K1AA1 821 protein 3.0 
416933 BE561850 Hs.80506 small nuclear ribonudeoprotein polypept 3.0 
414324 Y14768 Hs.890 tymphotoxin beta (TNF superfamay, membe 3.0 
425081 X74794 Hs.154443 minlchrornasome maintenance deficient (S. 3.0 
65 401519 C15()00476*:flil12737279ireflXP_012163.1| 3.0 

411704 A1499220 Hs.71573 hypolheticalproteBiaJ10074 3.0 
428819 AL135623 Hs.193914 WAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen 3.0 

413835 AI272727 Hs.249163 fatty acid hydroxylase 3.0 

412600 L28824 Hs.74101 spleen tyrosine kinase 3.0 

410491 AA465131 Hs.64001 Homo sapiens done 25218 mRNA sequence 3,0 

5 433656 L03678 Hs.156110 immunoglobulin kappa constant 3.0 

427666 AI791495 Hs.180142 calmodulWike skin protein 3.0 

452514 AI904898 gb:RC-BTG68-i 30399-085 BT068 Homo sapien3.0 

429500 X78565 Hs.289114 hexabrachion (tenascln C, cytotacrJn) 3.0 

432485 N90866 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 3.0 

10 437400 AB011542 Hs.5599 EGF-like-domain. multiple 5 3,0 

452234 AW084176 Hs.223296 ESTs, WeaWy similar to I38022 hypotheti 3.0 



413269 BE167526 gb:CM4-HT0509-080300-1 07-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cONA DKFZp586G0321 (f 3.0 
400929 ENSP0000Q252232':Sterol regulatory eleme 3.0 



15 445145 AI961702 Hs.147434 ESTs 3.0 

432615 AA557191 Hs.55028 ESTs, Weakly similar to 154374 gene NF2 3.0 

423279 AW959861 Hs.290943 ESTs 3.0 

429392 AL109712 Hs.296506 Homo sapiens mRNA full length Insert cON 3.0 

408548 AA055449 Hs.63187 ESTs, WeaWy similar to ALUC JHUMAN HI! 3.0 

20 451346 NM_G06338Hs.26312 glioma amplified on duorriosorne 1 protein 3.0 

413109 AW389845 Hs.110855 ESTs 3.0 

401714 NA ENSP00000241802*:CDNAFU11007F1S,CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 

421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 

25 453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 Hs.98138 ESTs 3.0 

438930 AW843633 Hs.306163 hypothetical protein AL1 10115 3.0 
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TABLE 22A 

Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Unique Eos probeset Identifier number 



Pkey: 

CAT number 
Accession: 



Gene cluster number 



Pkey CAT number Accessions 

X Si 

411743 1256098 1 AW862214 AW859811 AW862215 miaftcrtfl 

412138 1279172J AW895387 AW895547 AW895564 AW895323AW895405 AW895539 AW895538 

413269 1356961 1 BE167526 BE167651 BE076401 R24654 

416935 163179 1 AA190712 AA190665 AA252564 

422128 211994J AW881145 AA490718 M85637AA304575T06067 AA331991 

423945 233566 1 M410943 AW948953 AA334202 AA332882 aoocco^ 

424109 235506~1 AW406878AW966560 AW966151 AW966496 AA336174 M335376 AA335537 

424128 235728~1 AW966163 AA335983 AA33601 1 AA335668 AA335973 

425331 250199*1 AW962128AA355353 AA427363 

426878 273265 J BE069341 AW748403 AL044891 A1908240 AA393080 

432745 353673 1 AI821926AA658826AA564492AA635129AI791191 

441153 51084 2 BE562826 BE378727 

448212 755099 1 AI475858 AW969013 

451128 859865J AL1 18668 D78823 AI762176 

452514 920172 1 AI904898 AI904849 AI904899 

456207 165078.-1 AA193450 
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TABLE 22B 

Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 

listed. 

Pkev Uniaue number corresponding to an Eos prabesel ... r 

Refc Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gi) numbers. -Dunham I. et al. refers to the publication 

entitled The DN A sequence of human chromosome 22." 0^1^1.6131.^3^(1999)402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposHion 


400814 8569925 


. Minus 


72840-72924,74761-74849 


400929 7651921 


Minus 


122033-122241,123483-124028 


401045 8117619 


Plus 


90044-90184,91111-91345 


401451 6634068 


Minus 


119926-121272 


401519 6649315 


Plus 


157315-157950 


401645 7657839 


Minus 


34986-35133 


401714 6715702 


Pius 


96484-96681 


401666 8018106 


Plus 


73126-73623 


402327 7656695 


Minus 


108675-108770.109801-109910 


402359 9211204 


Minus 


40403-41961 


402408 9796239 


Minus 


110326-110491 


402470 9797107 


Plus 


195129-195776 


402542 9801558 


Minus 


67076-67594 - 


402578 9884928 


Plus 


66350-66496 


402606 9909429 


Minus 


81747-82094 


403011 6693597 


Minus 


3468-3623 


403212 7630897 


Minus 


156037-156210 


403329 8516120 


Plus 


96450-96598 


403366 8783692 


Minus 


49323-49652 


403485 9966528 


Plus 


2888-3001,3198-3532,36554117 


404347 9838195 


Plus 


74493-74829 


404580 6539738 


Minus 


240588-241589 


404755 7706327 


Minus 


53729-53846 


405017 6532084 


Plus 


35551-35690 


405348 2914717 


Minus 


4331043462 


405381 6006920 


Minus 


7636-8054 


405801 2924321 


Plus 


63469-63694 


405850 6164995 


Plus 


1387144110 


406153 9929734 


Minus 


1290243069 


406348 9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst thel2 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 

1 0 (z.e. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

15 UnigenelO: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of 50* percentile normal body tissue to 75* percentile tumor 



20 Pkey ExAccn UnigenelO UnigeneTitJe Ratio 

428722 U76456 Hs.190787 

428848 NMJJ00230 Hs.194236 

445263 H57646 Hs.42586 

25 418935 T28499 Hs.89485 

407228 M25079 Hs.155376 

417511 AL049176 Hs.82223 

439285 AL133916 Hs.172572 

412442 AI983730 Hs.265'30 

30 410544 AI446543 Hs.95511 

412047 AA934589 Hs.49696 

422667 H25642 Hs.133471 

405664 L34041 Hs.9739 

423201 NM.000163 Hs.125180 

35 422163 AF027208 Hs.1 12360 

428769 AW207175 Hs.106771 

407049 X72632 

412295 AW088826 Hs.1 17176 

425126 N32759 Hs.172944 

40 406791 AI220684 Hs.272572 

447471 AF039843 Hs.18676 

451533 N&L004657 Hs.26530 

419407 AW410377 Hs.41502 

411939 AI365585 Hs.146246 

45 410532 T53088 Hs.155376 

425707 AF1 15402 Hs.1 1713 

416585 X54162 Hs.79386 

443060 D78874 Hs.8944 

435265 AA779958 Hs.185932 

50 422511 AU076442 Hs.117938 

433138 AB029496 Hs.59729 

402195 

429350 AI754634 Hs.131987 

445107 AI208121 Hs.147313 

55 406643 N77976 Hs^72572 

410199 AW377424 Hs.205126 

417225 AA815048 Hs.24078 

437569 AA760849 Hs.294052 

436062 AK000027 Hs.98633 

60 425078 NMJXJ2599 Hs.154437 

430327 AW973636 Hs.55931 



tissue Inhibitor of metalloprotelnase 4 214 

leptin (murine obesity homolog) 17.4 

KIAA1560 protein 15.4 

carbonic anhydrase IV 15.0 

hemoglobin, beta 14.6 

chordin-fike 14.6 

hypoflietical protein FU20093 143 

serum deprivation response (phosphatidyl 13.6 

ESTs 12.6 

ESTs 122 

ESTs 12.0 

glycerophosphate dehydrogenase 1 (so 1 2.0 

growth hormone receptor 1 17 

prominin (mouse)-like 1 10.8 

ESTs 10.6 
NMJ)21724*:Homo sapiens nuclear receptor 10.1 

pory(A)-binding protein, nuclear 1 9.8 

chorionic gonadotropin, beta polypeptide 9.8 

hemoglobin, alpha 2 9.5 

sprout/ (DrosophBa) homolog 2 9.5 

serum deprivation response (phosphatidyl 9.4 

hypothetical protein FU21276 9.0 

ESTs 9.0 

hemoglobin, beta 6.9 

E74-tike factor 5 (ets domain transcript 8.8 

teiomodin 1 (smooth muscle) 8.7 

procollagen Oendopeptidase enhancer 2 8.6 

ESTs 8.5 

collagen, type XVII, alpha 1 8.4 

semaphorinsem2 8.3 
NNL00449T :Homo sapiens hepatocyte nude 8.1 

ESTs 8.1 

ESTs, Weakly similar to 138022 hypotheti 8.t 

hemoglobin, alpha 2 8.0 

Homo sapiens cDNA: RJ22667 fis. done H 8.0 

hypothetical protein FU12649 7.8 

ESTs 7.5 

ESTs 7.5 

phosphodiesterase 2A. cGMP-stimulated 7.5 

ESTs 7.4 
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447577 AI393693 
446039 AI150491 
422060 R20893 
424455 AA452006 
424343 AW956360 
442792 A1352340 
406714 AI219304 
407571 AI446183 
429580 AA346839 
453500 Ai478427 
422233 AB002058 
420205 AA256395 
404368 NA 
447261 NMJJ06691 
417090 AA193282 
410677 NM_003278 
427283 AL1 19796 
415011 AW963085 
412068 S72043 
416253 BE250659 
435885 AA701483 
402779 NA 
418138 AA213626 
439335 AA742697 
427019 AA001732 
411478 BE143068 
452654 BE0O4783 
447359 NM_012093 
414323 NM.014759 
441266 H15968 
417011 F08212 
400089 NA 
433614 W07475 
440439 N92818 
454404 BE067414 
436704 AA062610 
406563 NA 
433490 AW451023 
419313 AA843387 
409196 NNLQ01874 
410882 AW809163 
453469 AB014533 
441899 AI372588 
426210 AA372052 
413065 BE063555 
454192 AW876813 
425187 AW014486 
429757 AW452355 
420202 AL036557 
416284 A1695473 
428553 AA181641 
404689 MA 
438887 R68857 
406082 S47833 
449748 H23963 
431048 R50253 
452205 C15819 
434040 AW444613 
407744 AB020629 
450606 A1668605 
414629 AA345824 
401665 

436107 T99079 
444432 AI161428 
434715 BE005346 
447205 BE617015 
408122 A1432652 



Hs.183297 

Hs.90756 

Hs,325823 

Hs.333199 

Hs.4748 

Hs.131194 

Hs.283108 

Hs.9572 

Hs.209100 

Ks.43125 

Hs.1 13275 

Hs.88156 

Hs.17917 
Hs.85863 
Hs.65424 
Hs.174185 

Hs.73133 
Hs.15463 
Hs.36341 

Hs.136204 

Hs.62492 

Hs.173233 



Hs.18268 
Hs.334688 
Hs,293845 
Hs.234898 

Hs.277101 
Hs.64754 

Hs.148050 

Hs.65848 
Hs.87279 
Hs.334873 

Hs.33010 

Hs.8022 

Hs.334559 

Hs.3343 

Hs.22509 

H3.256037 

Hs.95910 

Hs.298006 

H3.184907 

Hs.265499 
Hs.82927 
Hs.32043 
Hs.249129 

Hs.288809 
Hs.38095 
Hs.60380 
Hs.76688 

Hs.191194 
Hs.75916 
Hs,1 16410 
Hs.1 1006 
Hs.42824 



DKFZP566F2124 protein 
ESTs 

ESTs, Moderately similar to ALU5 HUMAN A 
ESTs 



ESTs 

hemoglobin, gamma G 

ESTs, Highly similar to CYA5_HUMAN ADENY 

DKFZP434C171 protein 

esophageal cancer related gene 4 protein 

purinergic receptor P2X-fike 1 , orphan r 

ESTs 

ENSP00000241075*:TRRAP PROTEIN, 
extracellular link domain-containing 1 
ESTs, Weakly similar to B34612 zinc Gng 
tetranectin (plasminogen-bindlng protein 
ectonudeotide pyrophosphatase/phosphodi 



7.4 
7.2 
7.2 
7.1 
7.1 
7.0 
6.9 
6.8 
6.7 
6.7 
6.7 
6.6 
6.6 
6.5 
6.5 
6.5 
6.5 



gb:EST375158 MAGE resequences. MAGH Homo6.4 
metallothionein 3 (growth Inhibitory fac 6.4 

Homo sapiens, done IMAGES 959994, mRNA 6.4 

ESTs 6.3 

Target Exdn 6.3 

EST 6.3 

ESTs, Weakly similar to B39066 prolines 6.3 

hypothetical protein FU10970 6.2 
gbMRQ-HT0158430200-003-b09 HT0158 Homo6.2 
gb;MR2-BN0114-270400-QQ4-e11 BN0114Homo6.1 

adenylate kinase 5 6.1 

KIAA0273 gene product 6.1 

Homo sapiens, done IMAGE:3502329, mRNA, 6.1 

ESTs, Weakly similar to 2109260A B cell 6.0 

Eos Control 6.0 

cytochrome c oxidase subunit IV isoform 5.9 

ESTs. Weakly similar to potential COS (H 5.9 
gb:MR4-8T0355-200100-201-e05 BT0355 Homo 5.9 

EST • 5.9 

Target Exon 5.9 

hypothetical protein DKFZp7610132 5.9 

ESTs 5.9 

carboxypeptidaseM 5.8 
gb:MR4-ST01 18-261099-O12-a03 ST01 18 Homo 5.8 

K1AA0633 protein 5.8 

TU3A protein 5.8 

Homo sapiens cDNA FU14458 (is, done HE 5.8 
gb:CM1-BT0283481199-O33-d09 BT0283 Homo5.8 

phosphoglycerate dehydrogenase 5.7 

ESTs 5.7 

ESTs 5.7 

putative lymphocyte G0/G1 switch gene 5.7 

ESTs 5.7 

G protein-coupled receptor 1 5.6 

Target Exon 5.6 

ESTs 5.6 

adenosine monophosphate deaminase 2 (iso 5.6 

ESTs 5.6 

cell death-Inducing DFFA-Rke effector a 5.5 

gb:C15819 Clontech human aorta polyA mRN 5.5 

hypothetical protein FU20159 5.5 

ATP-btnding cassette, sub-family A (ABC1 55 

ESTs, Moderately similar to ALU6_HUMAN A 5.5 

carboxytesterase 1 (monocyte/macrophage 5.5 

C1 1000703:gil10048448|ref|NP_065258.1| g 5.5 

ESTs 5.5 

splicing factor 3b, subunit 2, 1 45kD 5,5 

ESTs 5^5 

ESTs, Moderately similar to T17372 plasm 5 5 

hypothetical protein FU10718 5.5 
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454016 
414913 
459033 
441003 
450637 
442398 
403612 
407102 
410057 
428232 
432769 
431344 
427032 
406305 
437411 
442800 
402054 
432085 
415313 
459159 
427164 
441391 
458959 



401810 
438879 
414657 
427809 
456063 
451186 
451882 
402583 
431130 
458218 
416083 
455282 
426488 
426156 
407891 
408610 
445967 
434813 
437526 
454775 
409451 



405062 
446490 
417622 
421978 
440338 
415421 
417574 
409882 
447998 
445613 
443074 
451324 
432433 
449654 
414519 
457531 
433200 
430782 
427555 
433545 
420334 



AW016806 
R25621 
AA017590 
BE172240 
N49826 
AA994520 
NA 

AA007629 
R66634 
BE272452 
AA620814 
R99530 
AF012023 
BE261320 
AW613948 
AI809481 
NA 

AF212829 
R59638 
A1904646 
AB037721 
BE467930 
AI285901 
NA 
NA 

AA827674 

AA424074 

M26380 

NNL006744 

AW023469 

AI821324 

NA 

NMJ06103 

AI435179 

R53467 

BE143867 

X03350 

BE244537 

AA486620 

AW026692 

D59597 

AI524307 

AI076012 

BE160229 

AF012626 

AW502327 



Hs.233108 

Hs.129907 
Hs.126379 
Hs.18602 



ESTs 5.5 
gb:yh45R)6.r1 Soares placenta Nb2HP Homo 5.4 
ESTs 5.4 
ESTs, Weakly similar to I38022 hypotheti 5.4 
ESTs 5.4 
gb:ou42g09^1 Soares_NBjr_GBC S1 Homos5.4 



AK000706 

AW298163 

AJ243662 

R62431 

R35009 

R00348 

AJ243191 

AI768289 

BE550889 

AW341470 

Ai783600 

AW014734 

AI989812 

N94587 

AW973716 

AA682722 

AF026263 

AW137094 

AA868510 

AI349351 



Hs.9739 

Hs.268107 

Hs.183109 

Hs.144959 

Hs.272572 

Hs.173274 

Hs.158196 

Hs.194915 

Hs.131227 

Hs.272406 
Hs.6181 

Hs.173871 
Hs.170381 
Hs.181297 



Hs.189073 

Hs.76780 

Hs.180878 

Hs.76461 

Hs.65256 

Hs. 100445 

Hs.2719 

Hs.126820 

Hs.269122 

Hs.4 

Hs.167382 
Hs.41135 
Hs.224829 
Hs.118821 
Hs.162870 
Hs.121388 

Hs.54472 



5.3 
5.3 
5.3 
5.3 
5.3 
53 
53 
5.3 
5.3 
5.3 
53 
53 
52 



Target Exon 

glycerophosphate dehydrogenase 1 (so 
multimerin 

monoamine oxidase A 
ESTs 

hemoglobin, alpha 2 

integrin cytoplasmic domain-associated p 
transcriptional adaptor3 (ADA3, yeasth 
ESTs 
ESTs 

Target Exon 

potassium channel, subfamily K, member 9 
E STs 

gb:QV-eT0654)20399-103BT065Homosapien 52 
KIAA1300 protein 5.2 
ESTs 5.2 
ESTs 5.2 
ENSP00000251335 4 :DJ1003JZ1 (sodium and 5.2 
Target Exon 52 
ESTs 52 
protein phosphatase 1. regulatory (Inhib 5.2 
lipoprotein lipase 5.1 
retinol-binding protein 4, interstitial 5,1 
ESTs, Weakly similar to teucine-rich gli 5.1 
ESTs 5i1 
NM_021620:Homo sapiens PR domain contain .5.1 
HE4; WFDC2; putafive ovarian carcinoma m 5.1 
ESTs sj 
ESTs,WeaklysimiiartoALU1_HgMANALUS 5.1 
gb:lW04TO16447010Q-013-h02 HT0164 Homo5.1 

5.1 
5.1 
5.0 
5.0 
5.0 
5.0 



alcohol dehydrogenase 1B (class I), beta 
natriuretic peptide receptor A/guanyiate 
endomudn-2 
ESTs 

CGI-62 protein 
ESTs 

ESTs, Weakly similar to MDHC HUMAN MALAT 5.0 
gb:QV1-HT0413-090200-062-a12 HT0413 Homo5.0 

fragile X mental retardalion 2 5.0 
gb.tJI-HF-BR0p-aka^)7-0-Ul.r1 NIH_MGC_5 5.0 

Target Exon 5 0 

Hs.15125 hypothetical protein FU20699 5.0 

Hs.82318 WAS protein family, member 3 5.0 

Hs.110196 NICE-1 protein 50 

Hs.12758 ESTs 50 

Hs.24903 ESTs 5.0 

gb.7e69e06.rl Soares fetal liver spleen 5.0 

Hs.56874 heatshock27kDproteintarr% l niernber7 5.0 

Hs.304389 ESTs 49 

Hs.158491 ESTs 49 

Hs.144907 ESTs 49 

Hs.208052 ESTs 49 

Hs.157969 ESTs 49 

Hs.199850 ESTs 49 

Hs.55063 ESTs 49 

Hs.13913 K1AA1577 protein 49 

Hs.192725 ESTs 4.9 

Hs.247920 cholinergic receptor, rraiscarinic 5 48 

Hs.97990 ESTs 4B 

Hs.1 12496 ESTs 43 

Hs.118944 hypothetical protein FU22477 4 8 
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60 



65 



421795 X63094 
427138 N77624 
413072 BE063965 
443721 AW450451 
408053 AW139474 
427067 AA843716 
442969 AI025499 
426220 AI383475 
414593 BE386764 
426893 AA398716 
434046 AW292618 
401590 NA 
457971 AW134679 
427722 AK000123 
443793 AA045290 
407737 R49187 
441955 AA972327 
441499 AW298235 
447517 AI382726 
403017 

450580 N40087 
404611 H58589 
414831 M31158 
459290 NMJKJ1546 
444341 AI142027 
408614 AL137698 
449638 AW204277 
434418 AF134707 
447360 AI375984 
419583 F00312 
440698 AI348455 
451199 AI290653 
438338 NMJM861 
433756 AW015933 
423301 S67580 
417237 H86385 
439745 AL389981 
424137 AA335769 
449338 H73444 
434744 N94835 
407402 AF035303 
443510 NNL012190 
415754 AA169114 
415986 Z43619 
457416 BE142052 
418064 BE387287 
437120 AI356125 
453950 AA156998 
401093 

436935 AW206494 
457974 AW842353 
428222 AL133112 
442705 AI264634 
437409 AL359599 
458494 A1380906 
410490 H03589 
416069 R37101 
438463 AA807958 
444043 AI499723 
438327 H87407 
457711 AF147401 
400870 

441425 AA933590 
416267 H45384 
407262 M12873 
444567 AV654020 
403263 NA 



Hs.283822 
Hs.173717 

Hs.266355 
Hs.246862 
Hs.177927 
Hs.132238 
Hs.171697 

Hs.97418 
Hs.113011 

Hs.242849 

Hs.180479 

Hs.25930 

Hs.6659 

Hs.142903 

Hs.101689 

Hs.182434 

Hs.15248 

Hs.35156 

Hs.77439 

Hs.34853 

Hs.146650 

Hs.46531 

Hs.250723 

HS278679 

Hs.167216 

Hs.147492 

Hs.124758 

Hs.6168 

Hs.1 12654 

Hs.1645 

Hs.81737 

Hs.149219 

Hs.16262 

Hs.394 

Hs.283828 

Hs.9520 
Hs.12247 

Hs.62654 
Hs.83384 
Hs.157767 
Hs.211568 

Hs.253560 
Hs.321717 
Hs.183085 
Hs.131127 
Hs.283850 
Hs.158436 

Hs.20982 
Hs.314232 
Hs.135089 
Hs.172944 
Hs.23917 

Hs.28937 



Rhesus blood group, D antigen 4.8 

phosphatide acid phosphatase type 2B 4.8 
gb:QV3-BT0296-140200-085-h01 BT0296 Homo 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

ESTs 47 

ESTs, WeaWy similar to T13924 sdk prote 4.7 

gb£01273249F1 NIH_MGC_20 Homo sapiens c 4.7 

ESTs 47 

ESTs 47 

TaigetExon 4.7 

ESTs 4.7 

hypothetical protein FU20116 4.6 

ESTs, Weakly similar to 2109260A B cell 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

Target Exon 4.6 

ESTs 4.6 

Homo sapiens cDNA FU 11027 fis, done PL 4.6 

protein kinase, cAMP-depende/it, regufeto 4.6 

inhibitor of DNA binding 4, dominant neg 4.6 

ESTs 4.6 
Homo sapiens mRNA; cDNA DKFZp434C1915 (f 4.6 

hypothetical protein MGC2747 4.6 

a disintegrin and metalloproteinase doma 4.6 

ESTs 4.6 
gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

• Homo sapiens cDNA FU1 1777 fis, done HE 4.6 

ESTs 4.6 

WAA0703 gene product 4.6 

Homo sapiens, done MGC:9764, mRNA, oomp 4.5 

cytochrome P450, subfamily IVA, potypept 4.5 

palmitoyt-protein thioesterase 2 4.5 

Homo sapiens mRNA full length Insert cDN 45 

ESTs 4.5 

adrenomedullin 4.5 

Homo sapiens genomic DrM, (Aromosome 2tq 4.5 

gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

fofmyltetrahydrofolate dehydrogenase 4.5 

hypothetical protein FU1 1413 4.5 

gb:HSC1GE121 normalized Infant brain cON 4.5 

kringle-containing transmembrane protein 4.5 

S100 caJciurn-binding protein, beta (neur 4.4 
ESTs, WeaWy similar to HXA2JWMAN HOMEO 4.4 

eukaryotic translafion initiation factor 4.4 

C12000586^i|6330167|dbj|BAA86477.1| (A 4.4 

ESTs 4.4 

ESTs, Weakly similar to S22765 heterogen 4.4 
Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 

ESTs 4.4 
Homo sapiens mRNA; cONA DKFZp547C126 (fr 4.4 



ESTs 



Hs.184261 



gb^42d08.r1 Scares placenta Nb2HP Homo 
ESTs 
ESTs 
ESTs 

chorionic gonadotropin, beta polypeptide 
ESTs 

C11000905:gi|11692565|gbJAAG39879.1|AF28 A.Z 

homeobox protein from AL590526 4 .3 

gbyn99c10.r1 Scares adult brain N2b5HB5 4.3 

gb:Human tg rearranged H-chain mRNA VDJ4 4.3 

ESTs, WeaWy similar to T26686 hypotheti 4.3 

TaigetExon 43 



4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
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410034 BE067414 

456804 A1421645 Hs.139851 

448427 BE395260 Hs.309438 

416931 D45371 Hs.80485 

5 421296 NMJXJ2656 Hs.103253 
400973 NA 

452602 AW366194 Hs.55962 

412330 NM_OQ5100 Hs.788 
405016 

10 435104 AI475671 Hs.88607 
406118 

418556 T02850 

429745 AA480818 Hs.221736 

433088 AW451206 Hs.1 15899 

15 444445 AA342329 Hs.1 15920 

453880 AI803166 Hs.28462 

447384 A!377221 Hs.40528 

414541 BE293116 Hs.76392 

444975 AV652165 Hs.182482 

20 403921 NA 

451477 AI798425 Hs.42710 
406344 

416970 AA191201 Hs.35861 

413662 BE155866 Hs.25522 

25 458504 AW070634 Hs.144794 
404682 NA 

418089 N69913 Hs.6858 
403433 

446532 AW975460 Hs.143563 

30 414217 AI309298 Hs.279898 

418425 AI871247 Hs.6262 

419589 AW973708 Hs.201925 

457029 AA397789 Hs.161803 

447860 AF193807 Hs.131835 

35 448988 Y09763 Hs.22785 

440610 AI733098 Hs.130800 
439590 AF086410 

427240 AA399975 Hs.274151 

408932 AW594172 Hs.278513 

40 436112 T77545 Hs.187559 

444382 AI144152 Hs.58246 

456716 AA318060 Hs.135121 

419846 NMJJ15977 Hs.285681 

410036 R57171 Hs.57975 

45 400545 NA 
403051 NA 

420139 NMJJ05357 Hs.95351 

450244 AA007534 Hs.125062 

453281 AA034116 Hs.118494 

50 440246 W52010 Hs.191379 

414516 AI3O7802 Hs.135560 

438232 A1150595 Hs.122226 

410233 AA082947 

412179 BE270758 Hs.69428 

55 441871 AI306150 Hs.153450 

426411 AK000708 Hs.169764 

453692 AL1 10416 

448640 AW817177 Hs.102558 

417481 AA203281 Hs.21798 

60 412912 AW1 18878 Hs.1 10835 

454183 AW807116 

426328 AW631296 

435942 R06285 Hs.191215 

417629 T76945 
65 403593 NA 

402690 

418190 R49591 Hs.270425 



gb:MR4-BT0355-200100-201-e05 BT0355 Homo4.3 



caveolin 2 4.3 

EST 4.3 

adipose most abundant gene transcript 1 4.3 

perilipin 4.3 

ENSP00000236667*:Mucin 5B (Fragment). 4.Z 

ESTs * 4.3 

A kinase (PRKA) anchor protein (gravin) 4.3 

CY00017r:gi|9280405jgb|AAF86402.1|AF245 4.3 

ESTs, Highly similar to F-box protein FB 4.3 

ENSP00000246632:CDNA FU20261 fis, clone 4.3 

gb:FB12A9 Fetal brain, Strafagene Homo s 4.3 

ESTs 4.3 

ESTs 4.3 

Homo sapiens cDNA: FU22816 fis, done K 4.3 

ESTs, Weakly similar to I38022 hypotheti 4.3 

ESTs 4.2 

aldehyde dehydrogenase 1 family, member 4.2 

ESTs. Weakly slmilarto T00362 hypofheU A2 

C5000212*:gil10047237|dbpAB13407.1| (A 4.2 

ESTs 4.2 

C5001660:gil1 161 1 537|dbflBAB1 8935.1 1 (AB 4.2 

DKFZP586E1621 protein 42 

KIAA1808 protein 42 

ESTs 4.2 

C9001188*:gi|12738842|reflNPJ73725.1| p 4.2 

ESTs, Weakly similar to I78885 serine/lh 4.2 

NM.001622:Homo sapiens alpha-2-HS-glycop 42 

ESTs 4.2 

Homo sapiens cDNA: FU23165 fis, clone L 4.2 

hypothetical protein MGC8407 4.2 

Homo sapiens cONA FU13446 fis, done PL 42 

ESTs 4.2 

Rhesus blood group, B glycoprotein 4.2 

gamma-aminobutyric add (GABA) A recepto 4.2 

ESTs 4.2 

gb:Homo sapiens foil length Insert cDNA 4.2 

Bgatin 4.2 

TP53TG3 protein 4.2 

ESTs 42 

ESTs 42 

hypothetical protein FU22415 4.2 

Williams-Beuren syndrome chromosome regi 42 

calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

lipase, hormone-sensitive 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs, Weakly similar to T43458 hypotheti 4.1 

ESTs 4.1 

gb:zn10gO7.s1 Stratagene hNT neuron (937 4.1 

hypothetical protein MGC3020 4.1 

ESTs, Weakly similar to 1909123ANagfuc 4.1 

hypothetical protein FU20701 4.1 

gb:DKFZp434K0431_r1 434 (synonym: htes3) 4.1 

Homo sapiens, clone MGC5352, mRNA, comp 4.1 

ESTs 4.1 

ESTs 4.1 
gb:MR4-ST0062-O4010(W)24-b12 ST0062 Homo 4.1 
gb:hh83o09.y1 NCLCGAP.GU1 Homo sapiens 4.1 

ESTs 4.1 

gb:yc92c07j1 Soares Infant brain 1NIB H 4.1 

Target Exon 4.0 

Target Exon 4.0 

ESTs 4,0 
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408641 AW245207 Hs.5555 hypotatical protein MGC5347 4,0 

427699 AAB29286 Hs.332053 senim amyloid A1 4.0 

445975 AI811536 Hs.145734 ESTs 4.0 

438831 8E263273 Hs.6439 synapstnil 4.0 

455578 BE006350 Hs.14355 Homo sapiens cDNARJ 13207 fis, clone NT 4.0 

401840 NA Target Exon 4.0 

413753 U17760 Hs.75517 lamlnln, beta 3 (niccin (125©), kaOnh 4.0 

445030 AI205925 Hs.147238 ESTs, Highly similar to AAC3_HUMAN ALPHA 4.0 

433873 AW156913 Hs.150478 ESTs, Weakly sim3ar to A Chain A, Ctysl 4.0 

456736 AW248217 Hs.1619 achaete-scuie complex (Drosophiia) hornol 4.0 

450112 BE047734 Hs.5473 ESTs, Moderately similar to ALU5_HUMAN A 4.0 

448906 A1589567 Hs.309719 ESTs 4.0 
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TABLE 23A 



10 



Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

409853 1156226J AW502327 AW502488 AW501829 AW502625 AW502687 

410034 1170594 J BE067414 BE067958 BE067419 BE067953 AW577127 AW601412 

410233 118656 1 AA082947 AA083036 

410490 1205347J H03589 AW750687 AW750688 

410882 1225686J AW809163AW809247 AW809177 AW809190 AW809225 

411478 1247073 1 BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848005 
413065 1347960J BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

413072 1348163 1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064O43 BE064033 BE063884 

414593 1464909J BE386764 BE387560 

414913 1506721J R25621 C03959 C04010 

415011 151328 1 AW963085M159005AW963073 

415986 1564410J 243619 R61274 H1 2206 R12883 

416267 1583547J H45384 H49125 H41699 

417574 1687770J R00348 R09593 

417629 1690392J T76945 R20210 R05755 

418556 1767866^-1 T02850 

419583 186198 1 F00312AA247490 F31427AA383663 F22045 

426328 264901J AW631296AA375464 

439590 47413 1 AF086410W94386 W74609 

442398 541271J AA994520 AW393574 

452205 90415J C15819 AA024741 AA024742 

452654 925931J BE004783 BE004947 AI911790 

453692 977825J AL1 10416 AW876759 

454183 1049636 1 AW8071 16 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141569AW807401 AW807310BE141565 AW807318 AW807119 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW8071 12 AW807378 AW807202 BE141593 AW807216 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW807110 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW80734B AW807311 AW807214 AW807132AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW807198 
AW807346 AW80731 5 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079AW807066AW807077AW845923AW807421 BE141619BE140943AW807420AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 

454404 1 170594J BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

454775 1234106J BE1 60229 AW819879 AW820179 AW819882 AW819876 AW820169 BE1 53201 AW993736BE1 52911 
455282 1273020J BE143867AW935060AW886684 
459159 919998 J AI904646 BE 179494 BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey. 
Reft 

Strand: 
NLposition: 



Pkey 

400545 
400870 
400973 
401093 
401590 
401665 
401810 
401840 
402054 
402195 
402583 
402690 



entitled The DNA sequence of human chromosome 22." Dunham I. etai. t Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted, 
indicates nucleotide positions of predicted exons. 



30 402698 



402779 
403017 
403051 
403263 
403433 
403593 
403612 
403921 
404368 
404682 
404689 
405016 
405062 
406118 
406344 
406563 



Ref 


Strand 


NLposition 


9800107 


Minus 


124618-124881 


9838306 


Minus 


3408145027 


7960452 


Minus 


98119-98253 


8516137 


Minus 


22335-23166 


9966320 


Minus 


33547-33649 


7145001 


Pius 


121591-122537 


7342191 


Plus 


129063-129476 


7684597 


Plus 


56283-56439 


8083691 


Minus 


82884806 


7689778 


Minus 


147901-148884 


7684486 


Pius 


94883-95003 


8348056 


Plus 


13368-13998 


8570304 


Minus 


108641-108903 


9586555 


Minus 


38173-39210 


6693623 


Plus 


78630-79367 


4827080 


Minus 


52694411 


7770677 


Plus 


52431-52737 


9719611 


Minus 


72225-72437 


6862650 


Minus 


62554-62712,6944949602 


8469060 


Minus 


94723-94859 


7711590 


Minus 


32974536 


7630956 


Minus 


102053-102199 


9797231 


Minus 


. 40977-41150 


7534100 


Plus 


119461-119717 


6524300 


Plus 


5199743308 


7657730 


Plus 


101283-101432 


9143818 


Plus 


53997-54629 


9255974 


Plus 


20254-20374,20526-20659^0835-21097 


7711604 


Plus 


3440144538 
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TABLE 24: 

Table 24 depicts Seq ED No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc, and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey: Unique Eos prabeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

UnigeneTitle: Unigene gene title 

Pre&CeHloa: Predicted Cellular Localization 

Seq.ID.No.: Sequence Identification Number found in Table 25 

Pkey ExAccn UnigenelD Unigene Titie 



Pred.CeO.loc. Seq. ID. No. 



449746 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 
425692 
426215 
439840 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 
439569 
114480 
404561 
325372 
112287 
335824 
424735 



427585 
429925 
429441 
421155 
420931 
420813 
452744 



AI668594 Hs.176588 
AI951118 Hs.326736 
AI733S81 Hs.72472 
AI127076 Hs.334473 
AA009647 Hs.8850 
NM_001394Hs.2359 
NM_001394Hs^359 
A1905687 Hs.2533 
AL120862 Hs.124165 
N92293 Hs.206832 
D90041 Hs.155956 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs.279727 
AW207206 Hs.136319 
U41060 Hs.79136 
W87707 Hs.82065 
AK001741 Hs.8739 
R41823 Hs.7413 
BE387335 Hs.283713 
AA026880 Hs.25252 
N32536 Hs.42645 
AW664964 Hs.128899 
W57554 Hs.125019 
AL117406 Hs.200102 
H26735 Hs.91668 
AA863360 Hs.26040 
BE153855 Hs.61460 
AL039402 Hs.125783 
BE270266 Hs.82128 
AI538613 Hs.298241 
D13666 Hs.136348 
AW602166 Hs.222399 
BE066778 Hs.151678 

NA 

AB033064 Hs.334806 
NA 

U31875 Hs.272499 

X07820 Hs.2258 

D31152 Hs.179729 
NM_000786Hs.226213 

AJ224172 Hs.204096 

H87879 Hs.102267 

AF044197 Hs.100431 

X51501 Hs.99949 

AI267652 Hs.30504 



ESTs, Weakfy similar to CP4Y.HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

hypothetical protein DKFZp56401278 

a disintegrin and metafloproteinase doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8_HUMAN A 

N-acetyttransferase 1 (arytamine N-acety 

stanniocalcm 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FU14035 lis, clone HE 

ESTs 

LI\M protein, estrogen regulated 
interleukin 6 signal transducer (gp130, 
hypothetical protein FU10879 
ESTs 

ESTs, Weakly similar to S64054 hypotheti 
Homo sapiens cDNA FU 13603 fis, clone PL 
solute carrier family 16 (monocarboxyfic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens done PP1498 unknown mRNA 
EST s, Weakfy similar to fatty acid omega 
Ig superfamOy receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific (actor 2 (fascicfin 
CEGP1 protein 

UDP4i^cetyi^ha4)^alactosarnIne:po(yp 
NM_014112*:Homo sapiens trichcfhtophala mitochodria 
Phase 2 & 3 Exons nuclear 
KIAA1238 protein 

ENSP00000249072 t DJ222E13.1 (N-TERMINAL 
short-chain alcohol dehydrogenase family 
matrix metalloproteinase 10 (stromelysln 
collagen, type X alpha 1 (Schmid metaph 
cytochrome P450, 51 (lanoslerol 14-aJpha 
Epophflln B (uteroglobin family member) 
lysyl oxidase 

smaB inducible cytokine B subfamily (Cy 
prolactln-friduced protein nucfear 
Homo sapiens mRNA; cDNA DKF2p434E082 (fr 
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ER 

extra celtular 



SeqiD1&2 
SeqlD3&4 
SeqlD5&6 
SeqlD7&8 
Seq ID 9 & 10 
SeqlD11&12 
SeqlD11&12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17&18 
Seq ID 19&20 
SeqlD21&22 
SeqiD23&24 
SeqtD25&26 
SeqID27&28 
Seq ID29&30 
Seq ID3U32 
Seq ID 33 & 34 
Seq ID35&36 
SeqlD37&38 
SeqlD39&40 
SeqlD41&42 
SeqtD43&44 
Seq ID 45 & 46 
SeqlD47&48 
SeqlD49&50 
Seq ID 51 & 52 
Seq ID53&54 
Seq ID 55456 
Seq ID57&58 
Seq ID 59 & 60 
Seq ID 61 &62 
Seq ID 63 & 64 
SeqiD65&66 
SeqlD67&68 
Seq ID69&70 
Seq ID 71 & 72 
SeqlD73&74 
Seq ID 75 & 76 
SeqtD77&78 
Seq ID 79 & 80 
SeqlD8U82 
SeqlD83&84 
SeqlD65&66 
SeqlD87&68 
SeqlD89&90 
Seq ID 91 & 92 
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420757 X78592 Hs.99915 
424905 NMJ»2497Hs.153704 
429859 . NM_007050Hs^25952 
446921 AB012113 Hs.16530 
445537 AJ245671 Hs.12844 
428227 AA321649 Hs.2248 
424001 W67883 Hs.137476 
421727 Y13153 Hs.107318 
452838 U65011 Hs.30743 
419667 AU077005 Hs.92208 
414812 X72755 Hs.77367 
426320 W47595 Hs.169300 
422867 U2137 Hs.1584 
411558 AA102670 Hs.70725 
417866 AW067903 Hs.82772 
428398 A1249368 Hs.98558 
431958 X63629 Hs.2877 
428722 U76456 Hs.190787 
412970 AB026436 Hs.177534 
421379 Y15221 Hs.103982 
415752 BE314524 Hs78776 
444051 N48373 Hs.10247 
451110 AI955040 Hs.265398 



NIMA (never In mitosis gene a)-related k 

protein tyrosine phosphatase, receptor t 

small inducible cytokine subfamily A (Cy 

EGF-like-domaln, multiple 6 

small inducible cytokine subfamily B (Cy 

patemaiiy expressed 10 

kynurenine 3-monooxygenase (kynurenine 3 

preferentially expressed antigen in meia 

a dlsintegrin and metafloproteinase doma 

monokine Induced by gamma interferon 

transforming growth factor, beta 2 

cartilage oligomeric matrix protein (pse 

gamma-amtnobiityric acid (GABA) A recepto 

collagen, type XI, alpha 1 

ESTs 

cadhertn 3, type 1, P-cadherin (placenta 
tissue inhibitor of metatloprotelnase 4 
dual specificity phosphatase 10 
small inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformatiorw 



cytoplasm 
nuclear 


SeqlD93&94 


Seqt095&96 




Seq ID 97498 


extracellular 


Seq ID 99 & 100 




Seq ID 101 & 102 


extracellular 


SeqID 103 & 104 




Seq ID 105 & 105 




SeqID 107 & 108 


nuclear 


Seq ID 109 & 110 




Seq ID 111 & 112 


extracellular 


Seq 1D113&114 


extracellular 


Seq ID 115 & 116 


extracellular 


Seq ID 117 & 118 




Seq 10 119 & 120 




Seq ID 121 & 122 




Seq ID 123 & 124 


plasma mernbraneSeq ID 125 & 126 




Seq ID 127 & 128 




Seq ID 129 & 130 


extracellular 


Seq ID 131 & 132 




Seq ID 133 & 134 




Seq ID 135 & 136 




Seq ID 137 & 138 
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TABLE 24 A 



Table 24 A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

1 0 Ptey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 Pkey CAT number Accession 

335824 CH22J197FGJ19J1JJNK..E 
325372 c12Jis 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
iisted. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham 1. et a)." refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham i. et at, Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposilion: Indicates nucleotide positions of predicted exons. 



15 



20 



Pkey Ref Strand NLposiUon 
404561 9795980 Minus 69039-70100 



346 



WO 02/059377 



PC17US02/02242 



Table 25 

The 69 gene sequences identified to be overexpresscd in breast cancer may be used to identify coding regions from the public DNA databases (nr and htgs m 
GenbankX ThTsequences may be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:516-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession*: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGG AGCCCT CCTGGCTTCA GG AACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
AG AGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTGA AAAGACAAG A TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 360 
GTTGGTCGAG G ACTTGTGAC CCTGGATGGT TCTAAATGGA AAAAGCACCG CCAGATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TG AGAGTGTT 480 
CGGATG ATGC TGAACAAATG GG AGG AACGC ATTGCOCAAA ACTCACGTCT GGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGG ACAGC ATCATG AAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGG ACTCA TACCTGAAAG CAGT GTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AG AG A AAGT A 780 
ATCCAGGACC GGAAGGAGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGG ACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAG ATGC 1020 
CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1 140 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGG AAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAAAATT CTGAAAAAAT ACATCCCTAT 1320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGGAAT CCATGTGTTT 1500 
GCAAAAAAAG TTTGCTAATT TTAAGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGGAA GAACXAAAGG AT AAATATAA T ACAAAAT AT ATGTATATGG TTGTTTG AC A 1 620 
AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
TCTTTCCTGT ATCTGGTGAA ACCCACAAAA ACACCTG AAA AAACTCAAGC TGACTTCCAC 1740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTC AG TTATCTTTCC CCAATAATAA AA AA 



Seq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 II 21 31 41 51 

MEPSWLQELM AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YP VKEFEVYH KJLMEKYPCAV PLVWGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKJLESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISELK IFITMMSESV RMMLNKWEER IAQNSRLELF 180 
OHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SOGQIFSKFN OELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLOAEVKT FMFAGHDTTS SA1SWILYCL AKYPEHQQRC RDEIRELLGD GSS1TWEHLS 360 
OMPYTTMCIK ECLRLYAPW N1SRLLDKPI TFPDGRSLPA GITVFTNIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFTPFSAGLR NCIGQHFAII ECKVAVALTL LRFKLAPDHS 480 
RPPQPVRQVV LKSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession*: NMJ>52997 

Coding sequence: 1 00-4 1 25 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAG ATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAGATG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAG A CGCCCAGAAG AGGACTGCTC TACACTGGGC CTGTGTCAAT 180 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCG AACACA GGACACCTCT GATG AAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTG ATAG ATTCTGGTGC CG ATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTGA GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCGAAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACGAAAAGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTG ATA AAAAATGCAA ATGCGAATGC AOTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGG AT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGG ATT TCATCACATT CATG AACAAA TTATGGAATA TATAOG AAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA GAAGGAACAT CTGCAGGAAC ACCTG ATGAG 780 
GCTGCACCCT TGGCGG AAAG AACACCTG AC ACAGCTG AAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA G AAAA AAGM 1200 
ACACCTGTAA AGACTGG ATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AOCAAGTGCC 1320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTOT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TG GAG AT AAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCA AAT AAAG CCTTTGAATT GAAG AATGAA 1560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTG AG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGG AA AATTAGAAGA GTCTCCTAAT 1740 
AAAG ATG GTC TTCTGAAGGC TACCTGCGGA ATG AAAG TTT CTATTCCAAC TAAAGCCTTA 1800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCGAG 1860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTG AA A A ATG A AC A A 1920 
ACATGGAGAG CAGATGAG AT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA AATGGAAAAT TAGAAGGGTC TCCTGTTAAA 2100 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTG AAGAA TGAACAAACA 2280 
TTG AGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAOAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAGA TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAG AGCCT CCCGAGAAGC CATCTGCCTT CG AGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGG AAT TG AAGAATGA ACAAACATTO 2640 
AGAGCAGATC AGATGTTCCC TTCAG AATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAG A GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTG TG A A AG A GCAAGGOAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGG A ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAA ATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTCCAGTG TGAGATTGAC TTTAAACCAA GAAGAAG AGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
G AAAGTAATT TGAATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TO AAAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT G AGGACATTA AGATTTTAAA AG AAAAG AAT 3360 
GCTGAACTTC AG ATGACCCT AAAACTGAAA G AGG AATCAT TAACT AAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT GAGAACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTAGGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTG AAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGG AGATGCT CTAAG AGAAA ATACATTGGT TTCAG AACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAAC ACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAG AAA 3960 
GCTGAC AAC A AAAGCAAGAT AACAATTGAT ATTCATTTTC TTG AGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAG AG AAAAA TG AGG AGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CATGAGAGAC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAG ACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TG AA AATCTT ACCAATAGTC TGTGTCAACA G AATACTTAT 4260 
TTTAG AAGAA AAATTCATG A TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTG AA 4320 
GAATTACTTG TTCACOAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATOC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTG ACAG AGT 4440 
G AGACTCCAC CTCGG AAA 



Seq ID NO: 4 pptein sequence; 
Protein Accession*: NIM43723.1 

1 U 21 3t 41 51 

MTKRKKTINL MQDAQKRTA LHWACVNGHE EVVTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACAN1LI DSG ADINLVD VYGNMALHYA VYSEILSWA KLLSHGAVIE VHNKASLTPL 120 
LLSnXRSEQ IVEFLLDGNA NANAVNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADI 180 
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CG VTAEHYAV TCGFHHIHEQ IMEY1RKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CIPESJYQKV MEINREVEEP PKKPSAFKPA IEMQNSVPNK 480 
AFEUCNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKING 540 
KLEESPNKDG LLKATCGMKV SIPTKALELK. DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKMEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEiDKINGK. 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQFC EMDKISGKLE 900 
DSTSLSKILD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEUCSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNAD1LNE KIREELGRIE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATLKHQYQ EKENKYFEDI 1080 
KILKEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1 140 
SHHPRLASA V QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKJLFQL QSKNM WLQQQ LVHAHKKADN KSKJT1DIHF LERKMQHHLL KEKNEEIFNY 1320 
NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 273-1785 (underlined sequences coircspond to start and stop codons) 
t 11 21 31 41 51 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTG AAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAG AAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAG A GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGC AGCACA 420 
GACGG ATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAG ATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT G ATGGACCTA TACACCACAG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TAT AAA AG AC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
ATTCCTCCTG GAGAATCCCT G AGAGACTTA ATTGAGCAGT CTCAGAGCTC AGG AAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTG AAGT CCACCACCCT AG ACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCG AGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTG AACAGA 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGOCTCAT CCTTTGGGAG 1500 
GTTGCTAGG A G ATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGG AC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1 680 
ATG ACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGG AGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 

S«i ID NO: 6 Protein sequence: 
Protein Accession #: none (bund 

1 11 21 31 41 51 

• LlXRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNTCSTD GYCFTMIEED 60 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDU 180 
EOSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYG EVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADDCGTG SWTQLYLTTD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 
T^VDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGULWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDl KL 

Scq ID NO: 7 DN A sequence 
5 Nucleic Acid Accession #: none found 

Coding sequence: 482-3007(underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGCAGCTT^ATC 60 
TACAACCTAA GAAAGCAAG A GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAA A TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTG AAGT 180 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAOTATGCA ^ 240 
1 5 CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAG(^AA 300 

CTTTATGAAG CTATGGGACT TG AC AAA A AG TGATATTTGA GAAG AA^TA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTOTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGG ATTTCAT CACATGACAA 480 

caTgaagctgVggI^catct™ 

20 CCAAACTCCA GTGCTCTCAT CCAG AGGCTC TTGTGATTCT CTTTGC AATT GTG AGG AAA A 600 
AGATGgScAATGCTAATAA ATTGTG AAGC AAAAGGTATC AAGATGGTAT ' CTGAAATAAG ^ 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTA AAT ™ 
CAC A A ATG AC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGG AT TTAACAATAT 780 

tgcaga^tVg^^ ™ 
25 tcacaattct ttagaaattc TTAAAGAGG a tactttccat GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA G ATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CA^ACTCAA^GTGTTW AGTCITCXTCCA^CATOT 1020 

CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT ^^I^^^A ^^ultaaaTP 1 1 40 
TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 140 

30 ggcctcc^tVgtgacitattgcagttaaa AACTTGGTTG GAGAACATGC LCTOAWTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAffTAG 1260 
ACTAAAGAAG GAATCTATTTGCCCTACTCC ACCAGTGTATGAAGAACATG AGGATCCrrC 320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAG AC 1380 
' C ACGTCC ATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA^ACAAAGCC 1440 

3 5 ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC1500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 
TCCTCCGCAA AATCCTAG AA AGCTCATTCT AGCGGGAAAT ATTATTCACA CTTTAATGAA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC ^J^^^f^^J^J!^J^ Q 
AGTTCTTG AA GAAGGATCGT TTATG AACCT AACGAGATTA CAAAAACTCT ATCTA^TCG 1740 

40 TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC 1800 
ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCrrTAATC^TGO^AA I860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACC^CATKTmmi920 
AGGGGTTCCT CTAACTA AGG TAA ATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTrTG G ATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

45 CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACA^^TOAC ATCCTCTGC^ CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AA^TAGTGAAA^^ 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG G ^ AT ^J™^^,^f 
TCTTACGG AC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 
50 TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT ^^^^^^^^^^J^^j^^^ 
AAAGAAACAA GTAGATGAGC AAATGAGAGA ^^^^J^^^^^E^lJ^li^j^S^^S^^ 
GTATGGCCAT AAAACCACTC ATCACACTAC TG AAAGACCC TCTGCCTCAC TCTATG AACA 2520 
GC ACATGGTG AGCCCCATGG TTCATGTCTA TAG AAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAQG A ATGAG AAAGA AGGAAGTGAT GCAAAACATCTCCAAAGAAG 2640 

5 5 tcttttggaa caggaaaatc attcaccact cacagggtca aatatgaaat acaaaaccac 2700 

G^lCC^TCA aS^ AATTTT TATCCTTCCA AG ATGCCAGC TCATTGTACA GAAACATTTT 2760 
AG^Sa>^G^^^SgGA>^TTC AGCAACTGGG AATCACAGAATACCTAAGGA AAAACATOC 2820 
TCAGCTCCAG CCTG ATATGG AGGCACATTA TCCTGG AGCC CACGAAGAGC TG AAGTTAAT 2880 
GGAAAC/^nA ATGTACTCAC GTCCAAGGAA GGTATTAGTG G AACAG ACAA AAAATG ACTA 2940 
60 TTTTGAACTTAAJ^CTAAn' TACATGCTG A ACCTGACTAT TTAG AAGTCC TGG AGCAGCA 3000 
AAC^AGATC GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TT AAGTCC AT 3060 

acotgTa^ 3120 
AACTATGGGG AAAAAAAAAG AAGAAG AAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
g^Itctt ? C^GCA^ AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



65 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 



70 1 11 21 31 41 51 

HNSLE1LKED TFHGLENLEF LQADNNF1TV IEPSAFSKLN RUCVUUTON AIESLPPNIF 180 
7S R^I ^DL RONOLOTm' VQFLEHIORI LDLQLEDNKW ACNCDLLQLK. TWLENMPPQS 240 
^V^SP PF^§^U<^sStP PVYeSeDPS GSLH1AATSS INDSRMSTKT 300 
pS WsTQ^CP IPCNCKVLSP SGLUHCQER NIESI^DLRP 360 
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10 
15 



PPONPRKUL AGNUHSLMK SDLVEYFTLE MLHLGNNR1E VLEEGSFMNL TRLQKLYLNG 420 
nXk^KGMFW^EYL YLEYNAIKE1 LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 
GWLTKWLK T^Fra^VS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WI Q K ^ 1 ^7Y M f 4 ^ 
TDDILCTSPG HLDKKELKAL NSEILCTGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

T 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #:NM_003474 

Coding sequence: 307-3036 (underlined sequences correspond to start rod stop codons) 

1 II il 31 41 51 
I I I I I 



75 



CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGQ 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

20 CTTTmAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG C000AOG0CT 180 
CCCGAGCTGACrCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCG AG AGCTGC^ CTG AAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTO 360 
GC^Sc TCC^raOC CTCCGAGGCC CGAGGGGTGAGCnATGG^CGAACMJ^AGA 420 

25 GCTGATG AAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAG AGC 480 

^ TTCGACTCCA AGAATCATCX AGAAGTGCTG AATATTOACTA^^GA^GC^GAA 540 
CTGATCATAA ATCTCGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC G G AAACCC AC 600 
TATf-TOCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGQATAT TCTG ATTCAG CAGTCAGTCT CAGCACGTGT 720 

30 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATOAAAGCT ATGTCTTAGA ACCAATGAAA 780 

30 ScSaTcaga^ 

TfirnG atcac atcacaacac accaaacctc gctgcaaaga atgtgtttcc ACT ACCCTCT SOU 

^AWTCGO^^GGC^TAAAAGAGAG ACCCTCAAGO CAACTAAGTATOTCGAGCTG 960 
rrr aTCTTGG cagacaaccg AGAGTTTCAG AGGCAAGCAA AAGATCTGGA AAAAGTTAAG 1020 
35 CAGCG ATTAA TAGAG ATTGC TAATCACGTT G ACAAGTTTT ACAGACCACT GAACATTCGG 
^ SgSgS tSStcSa AGTCTGGAATaACATOGACA "J™^ 

w'iTTCjrrA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
StGAC^ ATG^c7gCT?gS-GGG GTTTATTTCC AAGGGACCAC CATCGGCATG .260 
SxCC/£tCA TOAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GQ^CATTCA 1320 
40 GAGAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGACC TGGGCCACAA TTTCGGOAT0 1 1 M0 
a A^TCACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTOAGAA AGGAGGCTGC 1440 
ATPATG AACG CTTCCACCGG GTACOCATTT CCCATGGTGT TCAGCAGTTG CAGCAGO AAG 1500 
rACTTGGAGA OCAGC^GGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTOT 1620 
45 G ACTCTGGGG AGTCAG AGGA ATGTATGAAT CGCTGCTGCA ATOCCACCAC CTQXAC^Cllj 680 
AACttGGACG CTOTGTGCGC ACATOGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTOCA 740 
r-r AArAOCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
Arr^S? G^AGCCAA OTGWCCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
5 0 C^CTCCTAAACCTOCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
5 TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA ^^^^^J^Pj^f^^jL* 

a a AiirrnGAA AAATCCAGTG TCAAGGAGGT GCCAGOCGGC CAGTCATTGG TACCAATGCC 2100 

gtitSatag^ a^u^a^a^at ccccctgcag caaggaggccgg^ctgtoccgggggacc 2 m 

PAPfiTGTACT TGGGCGATGA CATGCCGOAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAG^G (^CAGACAGC 2400 

SSSaSSt OCTVWOCAOC OA^A^TCCCAOSAOATTGCTGCAGTG TCAGAATGTT 2700 
^ a r r-A (TAl^X^TrcA^ CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
So?~™« CTGCCAGACC CCTGCCAGCC 2820 

65 aTESE: vmaacMK xcagggoacc tgtaagccaa acccccctca gaagcctctg 2«m 

03 t£S£l?%rr nnmcCAC. AACAACTCGG CTCACTCATG CCTTGGCCAG OACCCCAGGA 2940 
SgSSc^^^^c^SSctg CTCCA^^CACCAA ,000 
rrnCCCAGAT CCACCCACAC CGCCTATATT AAOTGAGAAO CCGACACCTT TTTTCAACAG 3060 
^?S*rAfi A AnTfrcCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
70 MASTMSMMOTAaATMOMCmOAGaACTGraTCAGT 3180 
70 GCTGTGCTGTGCTATGGTGC TCTGTCTACT TGCACAGGTA C^AWTT AT^AT^ 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TnTACCATC ACTGAOTTTT 3300 
S^CAGG AAGGCTTGTT GTCCTTTTAG TAjmAGTC AA^GAAAT ATTCTOCTO 33« 



fVATGGCAGG AAGGCTTGTT GTGCTTTTAU TAl 1 1 iauivj /w-i /mv^»wv-wv. — «~ 

ATG GG AJTCT GG ACAG G ATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
^CTA^^G^TGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
rr^GTTGAT TTTCTC<^*IT CXXCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCaSgG AG GC^CTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTG AG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
5 CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTG AGCA AATCACCACA AACT GTAATA 3960 
CAATGATCCT GTATTC AG AC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

1 0 TATAGTTCAT GTCTGCTATC ATTATTCGTA G ATATTGG AC AAA GAACC TT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAA AAG ATGCTTTTAA CAGAGTCTG A 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAG AATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

1 5 AATGGC ATG A G AAATACAAA AATACTCAG A TAAGGTAAA A TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAG ACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATG AGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAG A ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

20 AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCC AAACAT A 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AG ATGCCCAA ATCCTTAG AT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATG A 5040 

25 ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NPJ003465.2 

30 

I 11 21 31 41 51 

MAARPLPVSP ARALLLALAG 1 ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 
SKNHPEVLNI RLQRESKELI INLERNEGL1 ASSFTETHYL QDOTDVSLAR NYTVILGHCY 120 

35 YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIE1ANHVDK. FYRPLNIR1V LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP 1MSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 

40 SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDG Y CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GK1CLNRQCQ N1SVFGVHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

45 LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRO FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAY1K 

50 Seq ID NO; 1 1 DNA sequence 

Nucleic Acid Accession #: NM_001 394 

Coding sequence: 400-1 584(underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

55 illlll 

GGCACGAGOG CTGAGCGCCG G AGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT OCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG (XTCCCCCTC CGCTCTGCGC GCCGCCCGGC 180 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGOACA 240 

60 GG GAAG AAG A GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCOAC CA TGG TGACOAT GGAGGAGCTG 420 
' CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGG ACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 

65 CTGG ACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGOC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCG AGCGC ACCG ACATCT GCCTGCTCAA AGGCGGCTAT 840 

70 GAGAGGTTTT CCTCCGAGTA CCCAGAATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
OTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1140 

75 OAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTG ATGATG AAGAAACGGG TGAGGCTGGA GG AGGCCTTC 1320 
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GAGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1 380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGOCCCTGG 1440 
GGACCCCTGC GGGAGCGGGG CAAGACCCCC GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
mCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCOC 1560 
ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 

CG ACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CGAGGCCAGC CAG AATGGCA 1740 
A?A^GG AC^ CG^Sa^A AT^GCAA ACAGAACACT CCAACTTAGA GCAATAACCG 1800 
G^CGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1860 
AG^mCTTA^TTTTTITAAGCAG TAAGGCTTGA ^^^^^f^^^^^^ A ^? QiQ 
CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAO 1980 
AAGACAAGTT TCCCAG AAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
A^^A^AAGGA ATTTCATTTT TAAAAGAAAT GTTCGAAGGT ^3/^ ^^'aT^t-T' r^-r*r^^r*ArVTA. ^2 l^K) 
ACCAACAG AT GAATAATTAT GCTTAATAAT AAAGTATTTA TT^GACTTT CTOAGAGTA 21 60 
TGAAAGTACA AAAAGTCTAG TTACAGTGG A TTTAGAATAT ATTTATGTTG ATCTCA^CA 2220 
GCTG AGCACC GTAGCATGCA GATGTCAAGG CAGTTAGG AA GTAAATGGTG TCJTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTT AAAA A^AA^^AAA 24M 
AGAAAAAG AC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
TGGGAGG AGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT TTTTTAAAAA AAAA 

Scq ID NO; 12 Protein sequence: 
Protein Accession »: NPJ)01385 

1 U 21 31 41 51 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVT^VRCN TTVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIVYDE RSPRAESLRE 20 
DSTVSLVVOA LRRNAERTDI CLUCGGYERF SSEYPEFCSK. TKALAAIPPP VPPSATEPLD 180 
LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
OYKCIPVEDK HKADISSWFM EA1EYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRS1ISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

Scq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 68-340(underlined sequences correspond to start and stop codons) 
1 1! 21 31 41 51 

AGCX3CCTTGC CTTCTCITAO GCTTTGAAGC ATTTTTGTCT ^J^^^^^J^^^^^L ™ 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA <™^J^™™^ 120 
TGCCCAG AAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 1 80 
TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCQACCA CTGCTGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA G ACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTOAGTCT '360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTG ATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 

Scq ID NO: 14 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 

Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: NMJH6640.2 

Coding sequence: 39-1358(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCTT AAGTTG ACCTCTGGGT CCGG AATCGC GGGCAAAG AT GGCGGCGGCC AGGTGTTGGA 60 
GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC G ACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCG ATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAG ACCT TCGCACTG AA TGCCGACCGC TGGTACCAGT 360 
ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCG AGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCG A TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAG ATAGAT G ATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
5 CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCOCACTATA AAATGTAAAC 840 
■ CAGACAAACTTCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAG ATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAG AA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAG A AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTG GAGTG 1080 

1 0 AAGCAGATGT TACTCG ACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1 140 
C CTTTTT CTG CTACCAC-CTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATG AAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 

1 5 TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTQTTAT TAG ATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 

20 AAAAGAACCC ACTGC 



Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP 057724.1 

25 

1 11 21 31 41 51 

MA AARCWRPL LRGPRLSLHT A ANAAATATE TTSQDVA ATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 1 20 

30 PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 

VSTLVOLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 

35 IVHFLLNRPKEEKSQLLEN 

Seq ID NO: 1 7 DNA sequence 

Nucleic Acid Accession #: NM_025059. 1 

Coding sequence: 3-2150 (underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 

GCATG AGCCT GG ACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCG AGG 60 

45 AAACTTACGA TCATCTTTCG G AAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCG A AGTG AACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 1 80 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAG A CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 

50 TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATG AAAAA TTACAAAAGT GTTCAAAAGA AAATG AGGAG AATAAGAAAC 480 
. AAGTTTCAAA GAATTGCAGG AAACATG AGG AATTTCTGAC TCAACTGCOT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATG AATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 

55 TCCATGAG AT GGAAGCAAA A GCTAGCAG AG AAACGATCAT GAGGCTGGCT TCAGAAGTCA 720 
ACAG AG AGCA GAAAAAAGCT GCCTCCTGTA CTGAAG AG AA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAGAAGCA AAAGAAGCTC TTGAAAGGOA AGTTAAGATC TTCCAAG AAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TGAGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 

60 AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AG ATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGOA GTCTGGGTTT CACCAG AAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTG ACAC ACCTGGAGGC AG AGCTGGTT TCTGG AGGTG 1260 

65 TTTTGCGAGA CAACTTGAAT TTTG AG AAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTG GTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAGAATTACA CATGAGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 

70 CACGCACGGC CTTGGTGGTT G AG AGGGACA ACGCGCATCT TACCATCAGG AACTTG CAG A 1 620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 

75 AAAGGGCCAG AAACATGATA G AAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AOAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TTCCTGATTA TG AAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACC1X3 TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 2160 
TCTTGAGAGA GGTGGCCATA AG ACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTG ATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT G AGAATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAGATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGG AT CCAGAAGAAT TCCACCAGAT TGCATG AGTT 2400 
AG ATTGGG AA ATGGGAGTGG GAGATAATAT TGGG AGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGG AG AAG CTGTG ACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTG AAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AGAGGAAGG A 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTG AG GTCAGG AGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTG AG ACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAG ATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3120 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAGAT 3 1 80 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAGAT G AAGAAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC 1 1 1 1 1 AAG AT CAGGATTTTA TTATG AATTC 3360 
CTGTCACTTT CTGTTTTCCA TTT AAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTGGG CAACAGAGAG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AGAGATG AAA CAATCAATTG CAAGGGTCAA AACTAGATTG G ATCTTGGTT TGAATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTG ATTGTTA TACTTTAAAT 4080 

3 5 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 

Seg ID NO: 18 Protein sequence: 
40 Protein Accession »: NP_079335.l 

1 11 21 31 41 51 

MSLDCTSHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTEITAHAAI KENQELKKXV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLUCK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSREE 360 
5 0 SRDRMVSQLE AQ1SELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAJELVSGGV 420 
LRDNLNFEKQ KYUCFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTALWERDN AHLTIRNLQK. 540 
KVERLQKELN TCRDLHTELK AKLADTNELK DCTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREVYSQM 660 
5 5 LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession U: AF071552, NMJ000662 

60 Coding sequence: 441-13 13 (undertined sequences conespond to start and stop codons) 

I 11 21 31 41 51 

65 CTTTGTATAA CGCTCAGCTA AAAGGG AAAT TG AGTGGGTC AGGTACCAC G GAT ACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAG ACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 180 
TTTGACTCAT CATTTAATCT TG ATTTCCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 

70 TG AAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGG ATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGG AA CAAATTGGAC TTGGAAACAT TAACTG ACAT TCTTCAACAC CAGATCCG AO 540 
CTGTTCCCTT TGAGAACCTT AACATCC ATT GTGGGG ATGC CATGG ACTTA GGCTTAGAGG 600 

75 CC AUUI GA TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTG ACC ACTATTGGTT TTGAG ACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATG ATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT OTCGATGCTG GGTTTGGACO CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGG AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTOACGG 900 
AAGAG AATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAG ACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
TTAAGCCTCG AACAATTGAA GATTTTG AGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACT AGT AAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CXTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT TAGAATAAGG 1320 
AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1380 
TfiTATCTTrrr GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NP0OO653.1 



1 11 21 31 41 51 

MDIEAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL N1HCGDAMDL GLEAIFDQW 60 
RRNRGGWCLQ VNHLLYWALT T1GFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNY1 120 
VDAGFGRSYQ MWQPLEUSG KDQPQVPCVF RLTEENGFWY LDQIRREQYI PNEEFLHSDL 180 
LEDSKYRKJY SFTLKPRT1E DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DUEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession U: NMJJ03714 

Coding sequence: 123-103 1 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CGGCACGAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAG AA 120 
CCATGTGTGC CG AGCGGCTG GGCCAGTTCA TG ACCCTGGC TTTGGTGTTG GCCACCTTTG 180 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAG AT CCAGCACTGT TTGGTCAACG 300 
CTGGCGATGT GGGGTGTGGC GTGTTTG AAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAG ACGCC TTG AAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CG ACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGG AG ATGA 600 
TCCATTTCAA GG ACTTGCTG CTGCACGAAC CCTACGTGG A CCTCGTGAAC TTGCTGCTGA 660 
CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGA AG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGCCTCOCA 780 
CGGCGCCCCC CG AGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 
GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TA TTCT GATA 1020 
TCCGGAGGTG AAATGAAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCrATGGACATTCCAAAAC ATTTACCATT AGAG AGGGGG GATGTCACAC GCAGG ATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC 1200 
CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 
AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTG TACTG TCTGGGCTGG 1680 
GGGGGACACT GTCCAAGGOA GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCG ATTTCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 
GCTTTCAAAC AA AA A A AA AA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession »: NP_003705 



1 11 21 31 41 51 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCG VFEC FENNSCEIRG LHG1CMTFLH NAGKFDAQGK SFKDALKCK AHALRHRFGC 120 
ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVTVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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Scq ID NO: 23 DNA sequence 
J Nucleic Acid Accession fc NMJ0OS264.1 

Coding sequence: 557-1 954(underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
10 I I I I I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 1 80 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGG AAGGAG GGGGCGGGGA CACCATTGCC 240 

15 CTGAAAG AAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGT CGCC G GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 

20 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATG AGT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

25 GGAG AAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1140 

3 0 CCTCCGGCAG TTCTTTG ACA AGGTCCCGGC CAAGCACAGC TACGG AATGC TCTTCTGCTC 1 200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAG ATCT CGCCTTGCGG All 111 1 1 AC CA ACTGCCAG CCAG AGTCAA GGTCTGTCAG 1 380 
CAGCTGTCTA AAGG AAAACT ACGCTG ACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 

J 3 AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAG AGTG CTTGAAATTT TTG AATTTCT TCAAGG ACAA 1 560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAAAA ACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 

45 GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTG ATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
A AAGCA AACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTT ACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTA AG GTAACTCCCA 2340 

5 0 TATCTCCTTT AATG ACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAG A TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

55 Seq ID NO: 24 Protein sequence: 

Protein Accession #: NPJW5255 

1 It 21 31 41 51 

OU MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCUCEQ SCSTKYRTLR QCVAGKETNF 60 
SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL OGNDLLEDSP 120 
YEPVNSRLSD IFRWPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DK VPAKHSYG MLFCSCRDIA CTERRRQnV PVCSYEEREK. 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 

65 YIPSS SLSVA PWCPCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Seq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: I576(catirc sequence represents open reading frame) 

1 U 21 31 41 51 
I I I 1 I I 
75 CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 1 80 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTG AGG GCCAAGACCA ACACCTACAT CAAG ACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

1 0 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTO ACCGCGACCC CGTGTTCG AG ATCACGuG iG CCCCAGGCAA 340 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CG AGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1O20 

1 5 G AACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGAT OTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 

20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 
CCGAATATTC TCCTAA 

25 

Seq ID NO: 26 Protem sequence: 

Protein Accession #: FGENESH predicted 



30 1 U 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLAIDQLC LLGLGEPPAP 60 
RAGEDGGGGG GG APAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKG ASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 180 

35 FMVTGRREDV ATARREHSA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 240 
LWGPKGATI KRIQQQTNTY UTPSRDRDP VFEITGAPGN VERAREEIET HIAVRTGK1L 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPG AH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

50 CTCAGGCAGA GCGACCCGCA G AAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCOC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

55 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGG ACCT GG ACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGA AGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTG ACATCAG G AGG ACATCT G ACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

65 GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTOTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 
G ACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGG OACC T GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTO 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC COGGAGGACC CAGCCCTGCC 1320 

70 AGGCTG AAGG AGGGCTCCTC ACGG ACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1 380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GOAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTOG ACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

7 5 G AG AAAGC AG AGGCCTCTAA TGCAGG AGCT GCCTGTATGG GG AACAGCCA GCACCAGGGC 1 680 
AGGCAGATGG GGGOGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCG AAAGCCC 1740 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 
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10 



25 



30 



ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 28 Protein sequence; 

Protein Accession #: FGENESH predicted 



1 11 21 3! 41 51 
I I I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
1C QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
ID GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 180 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPOSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
ZU GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPG VQGQARK 540 
EKAEASNAG A ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERA1LPA 660 
UCQTPKNNFA ERQKRLQAMQ KRRLHRSVL 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NMjD 1 23 1 92 

Coding sequence: 1 38-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I. I I I 1 I 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAG ACA 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCOGTGGGA CAACGAGGCC 120 

J J GCGGAG ACG A AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 1 80 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTG AG A 240 
AAATTAGTCC GAAT TGGGA A TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

4U ACCACGACCA TCACTCAG AC CACGAGCATC ACTCAGACCA TG AGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAG AAACAGC CAGGGG AAAG G AGCTCACCG ACCAG AACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

4 J TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTG AATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

J U TCAACCAAAT TG ATGCTAGA TCTTGTCTG A TTCATAC AAG TG AAAAGAAG GCTGAAATCC 1 080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT G AAG AACCAG 1320 

J J CAATGGAAAT GAAAAG AGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AG ATAAG AAG AAAAAG AATC 1 500 
AG AAGAAACC TG AAAATG AT G ATG ATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1 560 
CTCAACTTTC AACAAATG AG G AG AAAGTAG ATACAGATG A TCGAACTGAA GGCTATTTAC 1 620 

0 U GAGCAGACTC ACAAG AGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGG AAGAAG 1 680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTOC ACGATACACT CGGCCAGTCA G ACG ATCTCA 1800 
TTCACCACCA TCATGACTAC CATC ATATTC TCCATC ATC A CCACCACCA A AACCACCATC 1860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AG GAG CTG AA AGATGCCGGC GTCGCCACTT 1920 

OD TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTG ACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGG A ATGGCAACAG 2 160 
GAATTTTCAT TGGTCATTAT G CTG AAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

7 U GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
AG GG AG ATG A GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 

75 TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGG AGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTOAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTG AATTTA 2820 
AGCAAAGAAA TAAAGGAG AA AAGAGAAGAA TCTGAGAATT GGGG AGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
5 CAGAATTAGT ATAG AGTACA TTCATTAAAC ATTTTTGTCA GG ATTATTTC CCGTAAAAAC 3000 
GTAGTG AGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAG AAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 
TTCGTGCGGG TTATATACCA G ATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 31 80 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
1 0 CAAAATTATC AG AGTAGTAA AACTTTG ATA TATATG AGG A TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

15 Seq ID NO: 30 Protein sequence: 

Protein Accession #: NP_03645 1 .2 

1 11 21 3! 41 51 

20 MARKJLSV1LI LTFALSVTNP LHELKAAAFP QTTEK1SPNW ESG1NVDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKLLQ N1GIDKQCRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKG AHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

25 RSCUHTSEK KAEIPPKTYS LQlAWVGGFl AISIISFLSL LGV1LVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ N1EESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLDCQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDUHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

30 MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLXA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDM VPEML HNDASDHGCS 720 
• RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Scq ID NO: 3 1 DNA sequence 
35 Nucleic Acid Accession #: NM_002 184.1 

Coding sequence: 256-30l2(undertined sequences correspond to start and stop codons) 

I II 21 31 41 51 

40 GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTG ATTTGG 240 
AAATATCCGC GCAAG ATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

45 CTCACCACTG AATCTACAGG TG AACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
G ATTATTTTC ATGTAAATGC TAATTACATT GTCTGG AAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AG ATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTG A ACAGAATGTT 600 

50 TATGG AATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT G AGTTGCATT 660 

GTGAACGAGG GGAAG AAAAT GAGGTGTG AG TGGGATGGTG G AAGGG AAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTG ACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAG AG AA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTG AT 900 

55 CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAG AT GCCTCA ACTT GG AGCCAG AT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 140 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGT AAGGG AT ACTGGAGTGA CTGGAGTG AA 1200 

60 GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA OACATTGCCT 1320 
CCTTTTGAAG CCAATGG AAA AATCTTOGAT TATOAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTG ACAG TAAATCTCAC AAATOATCGC 1440 
TATCTAGCAA CCCTAACAGT AAG AAATCTT GTTGGCAAAT CAG ATGCAGC TGTTTTAACT 1500 

65 ATCCCTCCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGOTGTG TGTTATCAG A TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 

70 CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGOG AA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT G ACTAGTG AC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
A.CAGATGAAG GTGGGAAGGA TGGTCCAOAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

75 G G AG AAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTG AC AACTCTT^ 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCG AG AC CTAATTAA AA AACACATCTG GCCTAAJGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGG AA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGG AGCG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GGAAG ATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 
ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGT GAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Scq ID NO: 32 Protein sequence: 
Protein Accession #: NP_002 1 75. 1 

1 II 21 31 41 51 
I } 1 I I I 

MLTLQTWVVQ ALF1FLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT HNRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 180 
SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSIK SVUUCYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFR1R 300 
CMKEDGKGYW SDWSEEASG I TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYUTVTP VY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF 1RNYTIFYRT UGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDUKKH IWPNVPDPSK 660 
SH1AQWSPHT PPRHNFNSKD QMYSDGNFTD VSWETEAND KKPFPEDUCS LDLFKKEKIN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 840 
NEEDFVRLKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession ft NMJM8255. 1 

Coding sequence: 1 1 -249 1 (underlined sequences correspond to Stan and stop codons) 

1 11 21 31 41 51 
I I I I I I 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGG A GTCCTGAACT GGAGCTCTGG GCCCAGAGG A CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 180 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GG AGG ATCTG ATAATCAAGT GATTCACTGG G AAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGC AGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGG ATT AGAGGAGTGG AATGGGCAGC 660 
CTFTGGTAGA GATCTTTTCC TAGCAAGCTG TTC ACAAG AT TGCCTG ATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAG AAAC TCAGGATGAC G ATAACATAA G ACTGAAAGA 780 
AAATACI I I T ACCATAG AAA ATGAAAGTGT TAAAATAGCA TTTGCTGTTA C TCTGGA GAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTG AGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAG ATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGGAAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGG AT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTTCTG AT G A AG AGGAGC TGTTAACTAG TACTGGTTTT G AGTATCAGC AGGTGGCCTT 1 620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAGA ATACTTTGTO 1680 
GCCTGAAGTT CAAAAACTAT ATGGGCACGG TTATG AAATA TTTTGTGTTA CTTGTAACAG 1 740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAG AATTTA GTTTTCCACA GTTTGACAGT 1 860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCl I ACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTA GTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTOGT CTTGTGATTG 2040 
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GAGTCCTOAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGG G 2 1 00 
TG AGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT <^^TCCT 21 60 
GGACGTGGGT GGGGCTGTG A CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
AT ACGTGGTT GCAGTAGGAT TGG AGTGTGG AAAGATTTGC TTATATACCT GG AAAA AG AC 2280 
TGATCAAGTT CCAGAAATAA ATGACTGG AC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GG AAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTG AAGATC ACACTGTGAA 2460 
G ATACACAG A GTCAATAAAT GTGCACTGI4ATGG 

Seq ID NO: 34 Protein sequence: 
Fioiciu Accession #: N?_060?25. 1 

1 11 21 31 41 51 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 
VNCIQWIOCQ DGSPSTELVS GGSDNQVIHW ElEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTD VPILAC 1 80 
GNDDCR1HIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLY1 240 
KSTSLETQDD DNIRLKENTF T1ENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYKDG V 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVQEVGGN TLGFYDCQFN EDGSNfflAHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFUTVGTDQ TTRLF APWKR 420 
KDQSQVTWHE 1ARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCA1TGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKXYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKJTSVHSR UWSCDWSPD SKYFFTGSRD KKVWWGECD STDDdEHNI GPCSSVLDVG 720 
G AVTAVSVCP VLHPSQRYV V AVGLECGKIC LYTWKKTDQV PETNDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession*: NM 022131 

Coding sequence: 1 1-2878 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGCCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCG AG ACTTC ATATCATGGA GTCATAACTG AGAACAATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGG AG AGO GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
G AAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCOC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAG AGCCAG CCTACAAGGC TGTTGTG ACG GAGGGCAAG A TCTATGACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATG AAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAG ACT GGACCAAGAG G ATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
G AAATCCCTT CAGAAGTTAT GTGG AGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TG ATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CG ATGGGATT GTGCCCAAG A ACCTGACCGA 1 140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTG A GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA. 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGAC^CTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTC AGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAG AAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTG ATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
TGTGCGGCGC CTCAA AGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAG ACG TATGCATCAG 1 860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCG AGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT G ATATCAAG A TTGTG AGC AC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
G AAAACCACA GACCCCAAAT CAG AAGTCTT AG AGG AAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGG ATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC G CTATG AGC A GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTG AGO CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGG AGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TG AGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CX3TGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGG AG ACTG A GGCTGCCAAG G AATCTG AG A TGG ACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGG AGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGG AGGA AGAAGCCG AG GAAG AAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGG A GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGG A GCCAGGCAAG CCCAGCTGG A GTGGGATGAC TCCACCCTCC CCTACTAGTG 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGG GAAG GCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGG AAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTA GCCTCC A 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAG AGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAG AAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGCAGGGT CTG ATCCCCC ATCAG AGCTA TCTG AGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGG AGGCA TTCCACACAG GCACTGCCCC AGG ACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAG ATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAG AAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACG A GCTCCACCAA GTCT ACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC T CATAAAA GG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCC ATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 
CTGTG ATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT G AG AACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AG AGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGG AGG ACC CATGGCAGGT CTTTTCAACT TTCTG ATTCA TG AG AACAAC 4560 
CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGG ACCI '4620 
CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTG ATGAGT 4680 
AAACTGAGGC TTCG 

Scq ID NO: 36 Protein sequence: 
Protein Accession fc NPJ071414.1 

| 11 21 31 41 51 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPW1ETSYHG VlTENNDTVl 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 120 
TFIIOAYDCG AGPHETAWKK SHKAVVHIQV KDVNEFAPTF KEPAYKAVVT EGKIYDSILQ 180 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LOTKYIGKGC DRETYSEKSL QKLCG ASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
ROGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQ1C DKEWHYYYIN VEFPWTLYM DGATYEPYLY 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTTRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAG VRR 600 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARG VTLFP 660 
DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
HORHLDATNS TAGYSIYG VG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVS1L HEDQVSDKEH VNHL1VQPPF LQSVHHPESR SSIQHSSWP SIATWinS 840 
VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTTTVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession U: none found 

Coding sequence: 143-874 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

GGG AGGG AGA GAGGCGCGCG GGTGAAAGGC GC ATTGATGC AGCCTGCGGC GGCCTCGG AG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCCT^AGCTCOG 120 

cgctgccx:gg cagccgggag ccatscgacc ccagggccccgccgcct^otcagcggct 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GC AG AGGG AG ™ 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA MKrrTTCAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGG AGTTC 480 
ATTGAATTATGgSSaGATCTTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
A^TAGTCCTCTAAGAGm TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
C^C^G^TCCTA^ AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGG ACC AAGG AAGCCC TO AAATG AAT TCAACAATTA ATAJTCATCG 720 
CACTTCTTCT GTGGAAGG AC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGG AG ATGCT TCTACTGG AT GGAATTCAGT 840 
TTCTCGCATC ATTATTG A AG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1 080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTuTTGT h,0 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AG AATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA aaaaaaaaaa AAAA 

Seq ID NO: 38 Protein sequence: 
Protein Accession U: none found 

1 11 21 31 41 51 

MRPOGPAASP QRLRG LLLLL LLQLP APSS A SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSG SLRLKCR N ACCQRWYFT FNG AECSGPL PIEAIIYLDQ 180 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAWVOTCSD YPKGDASTGW NSVSRHIEE 240 
LPK 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession #: NM_000949 

Coding sequence: 285-21 53(underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCOT 60 
CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 
ATGTOCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 180 
TAAAG AACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT G AACXXTC AA 240 
GTGAACTTCT G ATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 
CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGG ACA G ATGG AGG AC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC COCAACTOCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAOTTC 660 
AGCCAGACCC TCCTTTGG AG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTOCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTG A CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTjGGAGA 1140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AG ACTTTCCT CCC ACTTCTG 1200 
ACTATG AGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC ^G^TCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CT^ATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAOJATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 
TTCATGCTGGTGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACO 1560 
CCAGATCCTCTTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTG AAT OAAGCAGGTA AAGATGCTTT AJVAATCCTCT CA^CCATTA 1 680 
AGTCTAGAGA AG AGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAG A 1740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TTCGCTAAAC 1 800 
CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1 860 
AACAG AG AG A G AACAGCGGC AAGCCCAAG A AGCCCGGGAC TCCTG AGAAC AA^GOWJT \M> 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT G A AG AATCAG CCAAAGAGGC CCCACCATCA CTTG AACAG A 2040 
ATCAAGCTG A G AAAGCCCTG GCCAACTTCA CTGCAACATC AAG^GTGC AGGCT^GC 2100 
TGGGTGGTTT GGATTACCTG G ATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGG AA TG ATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA G AGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTG ATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 
AAAAG^ACAC TGCTTAGTAT TCTTGAGGG A CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG G ACAG ACGGA AATG AAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT G AAAATTGCT TTTG AAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAACCATTT 2640 
AAAG ^AAG TaA TGCATTCCAO AAC A AATTGT TTACATAAGT TCCTATACCT TACTGACAC A 27O0 
TTGCTGATAT GCAAGTAAGA AAT 

Seq ID NO: 40 Protein, sequence: 
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Protein Accession #: NP_000940. 1 



1 U 21 SI •»! 

5 MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSFN KETFTCWWRP GTDGGLPTNY 60 

SLWIRKJET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTY1VOPDPP LELAVEVKQP EDRKP YLWIK WSPPTUDLK TGWFTLLYE1 RLKPEKAAEW 180 
EIHFAGOOTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL nVWAVAUCG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSB ELLSALGCQD 300 

10 FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 
PSOHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTDCSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAJCVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 

15 KCRLQLGGLD YLDPACFTHS FH 



20 



50 
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Scq ED NO: 41 DNA sequence 

NucJeic Acid Accession #: none found, Eos cloned sequence 

Coding sequence: 1-1572 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 
1 I 1 I 



ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTG A AGTGCCTGAT 60 
„ GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG JJ^^^STJ ^aatttaat in 
25 ATCATCA AGA CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TG AA^AAT 180 
AGCAGG ATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TG AGCAATCG TTTCGG ACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 3(K) 
GTCAGCACCG GG ATCGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGG ATACTGC TTTAGTTTTC TCCCAACTGT 
30 TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TOG^CTGT SgCTTTC^ ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACA^CCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTCCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAG^CA^ ATATCCTTG A AAATGAGAAA ACACGAACCT ^^^^^^^^JJ?^^^ ™ 
35 GGAGTAG AAC TAACTACCTC ACCTAAAAAT ^^^^^^^^f^^^S^^^^^^^^uy 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAG ACTTCTC CATTTTGAAA G AG AA AAGTT TTATTTGTTA TGCATTATTT 900 
GCTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
ACTCTG^G^A^H^ A^AGG A CCGCGCTGCT TTTTTATTAT CTACGATGGC ^TJGC^AA 1020 
40 GTTTTCGG AA GGATCGG AGO TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 

GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAAT^GA 1200 
GGGACTCACA TTCCACTGCT TGCTG AGGAT GATGTCGTGG GCATTG AGAA GATGTOTCT 1260 
GCAGCTGGGG TCTACATCTT CATtCAGAGC ATAGCAGGAC TOGCTGGACC GCCCCTTCCA 320 
45 GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT SgGTCAAAC AAAGGTAGTG AGCCATCGTG GG^GACTTT A^GGACm 1500 
CCTGAAGACT TTCTGG AAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTATGA 



Seq ID NO: 42 Protein sequence: 

Protein Accession #: none found, Eos cloned sequence 



1 11 21 31 41 51 

5 5 MTONKUolcS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG 1DCTFG VFFN DLMDSFNESN 60 
SRISWUSIC VFVLTFSAPL ATVLSNRFGH RLVVMLGGLL VSTGMVAASF SQEVSHM YVA 1 20 
TgUSGLGYC FSFLPTVTIL SQYFGKRRSl VTAVASTGEC FAVFAFAPAl MALKERIGWR 180 
YSLLFVGLLQ LNIVIFG ALL RPIFIRGPAS PKIV1QENRK EAQYMLENEK TRTSIDSIDS 240 
60 GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLOF^ E^CY^ 300 
GLFATLGFFA PSLYHPLG1 SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LN^IRKIY 360 
ffiUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHffLLAED DWGIEmSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSK1Y SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 



Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession P. FGENESH predicted ORF 

Coding sequence 1-1749 (underiined sequences correspond to start and stop codons) 



70 I 11 21 31 41 51 

ATGCTGTCTG GCTTCTTGATGAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
^^^rr^GGGA^cnccAic GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 

Sc^gg^agSaS 

75 GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCXGCGC CCAGC^CAG 240 
° c^OGGCCGCCGCCAGCTGG GCAGGCTCCC GGG ACTGCGG ™ 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCG A GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACG A 600 
5 CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA G ACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGG ACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCG ACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 

1 0 GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAG CGGCC CGCCTTGCCC 1200 

15 ATCTACGTGC CGTTCCTCAT TGTTGGCTCX GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCOCATGA TOCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGOA AGGGACCATG 1500 

20 AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATG AC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 
ACTGTATAAC CX3AG AGTCAC TGGTGGGTTC CTTTACTGAA GGGAG ACGAA OGCAGGGGTG 1 800 

25 GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTG ATT TATGATGGAA AACCGGCCTC AGCTGOAG AT GACTGTG ATG 1 980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TA TTTCTTTT 2100 

30 TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 
TTTTTT1 n 1 TTTTn 1 1 ' A A TCAQ ACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGC AACA CTCC TGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTG A CCTCAAGCAA 2400 

3 5 TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAG ACGCAAA TAACAGGACT 2520 
ATTCTAA AAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAG ATA f ATATATTTT GTCTATTTTT G TGCT TTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AG AGATCTTA TTACAAAGTG ATTTTCTACA 2700 

40 TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTG ATTGTAT CTTAATTTTC TGGCTTTAAQ GTGACATCTG AGA GGTAA TG CATTCTTTTT 2880 . 
TATATTG AAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 

45 GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTT AGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AAC CAOAT AC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 

5 0 ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CA AATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAA AAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATOA TACTCA 3600 

55 TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ©NO: 44 Protein sequence: 
^ Protein Accession ft FGENESH predicted 

1 II 21 31 41 51 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASl GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAW1RAQQQ PRPPPAGQAP GTAAGG AQDP RLRPG RSRGR VRLPVKPPEA 120 

65 SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQG AGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFL1VGS VFVAF11LGS 420 

70 LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

SeqID NO: 45 DNA sequence 
75 Nucleic Acid Accession #: NM.002285 

Coding sequence: 55*3738 (underlined sequences correspond to start and stop codons) 
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WO 02/059377 PCT/US02/02242 



1 I! 21 31 41 SI 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCATGGAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
5 G ATAGAAATG CATTACGGAG GAAAGAACGA G AAAGAAGAA ATCAAG AAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
G ATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA G AGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAG ATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA G AACCAGCCC 420 

10 TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG G ACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 

15 AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATG AGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAG AT G ACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 

20 AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 

25 AGCTCCTCCA GCG ACTCAGA GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTG ATCCAAA. ATG AAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTG AAAG AG 1 560 
GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 

30 ATCAAG AGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGOGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 

3 5 GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTG A CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 

40 GTAGGCTCCA TCA ACGCCAG GACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGG ATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAG AG CGCACCGCCC 2450 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAG AG GAAACGCAAG 2520 

45 TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGG ACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 

5 0 CTGCAGCCTC ACGGCGG AG A CCTCACG AAA GCAGCTCACA ACAATTCTGA AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCAC AGGGAC 2940 
TGCAAG AGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
. GAAGCTAAAC GAATGAAGCA TAAAGCAG AT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3120 

5 5 ATGG AATCCA AATCTCCTTA TTACCTG ATG TATTCAG AAA CAGTAGAGCT CATCAGGTAT 3 1 80 
GCTATGAG AC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAG ACAA ACAACTGGCT 3240 
GCATTATGTT ACCG ATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC G ACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGGG GGCCAGTGG A AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 

60 CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAG ATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 

65 AGCGCCCACC TGTCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GA AACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTG AACAT TTCCTCA 

Seq ID NO: 46 Protein sequence: 
70 Protein Accession 8: NP_002276 

I II 21 31 41 51 
I I I I I I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
75 KGDELSNRIQ NTLGNYDEMK DFLTDRTNQS HLVG VPKPG V PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMGWQK AGHPPSDGQQ RATQQGSLRT LLGDGVGRQQ 180 
PRAKQVCNVB VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQ APDESPK. LKSSSETSVH CTSYRG VPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKfP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PIUQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEDCSTCKEE QRPRTANKAP 540 
GSKG VKQKSP PA AVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDG A 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEF1ETES SSSSSSSDSD LESEQEEYPL SKAQTVAAS A SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 
HLPQEPG VLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EKKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASlCHK VTSEuLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK. ALNY AEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRUCTHS GPNATPEDKQ LAALjCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS 1PQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession H: NMJJJ3 15 1 

Coding sequence: 35 1 -4499(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

ACTGGGATAA AGCAAG AAGA CTGATTTTAT G AGCAGGGGT TTGATACATC AAAGGAGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTG AT TTGGGCCTG A GGCCTG AGAA GATGTTTAAA AAGAGGGATC AAGC ACAGGC 1 80 
TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTG ATTC TGGG AAACG A AG AATTGGCA GGAACTGAA A ATGACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGG A CTTATTTATA AAACCTATAC TCTCCAAG AT GGCCCCTGGA 480 
GTCAGCAAGA OAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TG ATCCAAAO CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 
GAGGGATTG A AAAAGCTTCA GTGCTTCTGG TG ATGCTG AG GTTCCAG AGA ACA AGGTTG A 840 
TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG OAATGTTOTC CATGGAGTGG 960 
GACTCTGCTT TG CCCTTTTT' CTCTCCG AAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AG AAGCTCAT CC AATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTG AAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATO GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTG ACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT C TCACG AATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 
TCCAG ACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAG ACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT I860 
CTG AGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAG AAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTOT TGTCAGCCAT CCTGGAGG AG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTG GATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC G ACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGG ACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT T AAGA AGACA CTCAGGGGGA 2400 
AG ACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGG AAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
AC AC AG C AAA GATAGCAGAG AAGCCAAACG TAGAAAGTCA GOCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAG AAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGG AACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CA AGCTCT TC AACAAGGTTT 3060 
TCCGCTGCCC CATG AGTTTC TTTG ACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3 1 80 
TGTCCTTAAT GGTG ATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGG AGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG G AGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATG A CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGG AT TGGCTTGG AG ACAG AGGCAC 3660 
AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATGAAG AT GTGTGTCTCG G AAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGG ATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGG AAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGG AGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGGC CTGGAGG ACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAG AT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GAG ACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTG ACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGG A 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTG AGAJAAG 4500 
GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGG AAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAG AACCCC AG AACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTG AT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

Seq ID NO: 48 Protein sequence: 
Protein Accession #: NP_149163.2 

MTRKRTYWVPNSSG GL VNRG IDIG DDM VSGUYKTYTLQDG P WSQQERNPEAPG RA A VPP WG K YD AALRT 

MIPFlU>KPRFPAPQPLDNAGLFSYLTVSWLTPLMIQSli*SRIJ)E^ 

VSRRGIEKASVIXVMLRFQRTRLIFDAIXGICFC1ASVLGPIUIPKILEYSEEQLGNW 

I^ECVKSL^FSSSWIINQRTAIRFRAAVSSFAFEKXIQFK^VIHITSGEAISFFTGDVKYLFEGVCYGPL 

VUTCASLVICSISSYFIIGYTAFIAIIXYLLVFPI^WMTRMAVKAQHHTSEVSDQPJRVTSF^LTCK 

UKMYTWEKPFAKIIEDUtRKERl^^ 

l^SLNUJU^VFFVPUVKGLTNSKSAVMRFKKFFLQESPVFTVQTLQDP 
VNGALELERNGHASEGMTRPRBAIGPEEEGNSLGPELJIKJNLVVSKGMMIXjVCGNTG 
MHLU^SVGVC^Si^YVFQQAWIVSGhnRE^ 
GLNLSGGQKQUSLMWWSDRQIYI^^ 

qullengkjcf^gthselmqkjcgkyaquq™hk^^ 

AVPEHOLTQEEEMEEGSLSWRVYIIHYIQAAGGYMVSCIIFFFVVUN^ 

RESNGTM ADLGNIADNPQLSFYQLVYG LN ALIXICV G VCSSG IFTKVTRKASTALHNKLFNKVFRCPMSF 

FDTIPIGRLLNCFAGDLEQLDQIXPiFSEQFLV^^ 

KKAIGVFKRLEhnfSRSPLFSHIIJ^SLQGLSSIHVYGKTEDnSQFKRLTO 

EJMTNLVTLAVAU^VAFGISSTPYSFKVMAVhnVI^l^SSFQATARIGLETEAQ 

EAPUMEGTSCPQGWPQHGEIIFQDYHMKYRDKTPTVL^^ 

LVEPMAGRIUDGVDICSIGLEDLRSKI^VIPQDPVIXSCT 

SKFPKKLFtTD WENG GNFS VG ERQLIXIARAVLRNS KIIIJDF^T ASIDMETDTUQRTIREAFQG CTV L 
VLAHRVTWLNCDHILVMGNGKVVEEDRPEVUIKKPGSIJAALMATATSSLR 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession^: NM 033419 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 
1 It 21 31 41 51 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 180 
TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCrTCTCCCG GTTCCTGTTC TTTCAAG AGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTG ATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 
TTTTCCACAC CAGGG ACACT GACCTCACAG AG AAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 
AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTG AAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATG AAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGG A AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
5 GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGG AAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TG AAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGG ATA 1 380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GG ACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 

10 ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GG AGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGG ATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCX3 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1 800 

1 5 GGTGGGTGTG TTAGCGTGGG TTAGGGG AAC GTGTGTGCGC GTGCTGGTGG GCATGTG AG A 1 860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGG AGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCA GCCCC 2100 

20 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTG A GGGAGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 

25 AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTG AAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACTTTT AAGGTGGG AG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 

30 

Seq ID NO: 50 Protein sequence; 
Protein Accession fh NP^2 19487.1 

MKDVGPESCG QPTPCWPSPA^LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 180 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

40 



Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession*: XM 059098.1 

Coding sequence: 178-61 8(underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

GATGTACACT CTGAAGTG AG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 

50 GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 180 
TCGTAC ACCA CAATGTGCAT CAAGGAGACG TGCCGATTG A TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTGAGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 

5 5 TTACCATTCT CAGCTGG ATC AAGG AACTGC ATTGGGCAGG AGTTTGCCAT GATTG AGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAGAT CTCAGGGTAC AATGATTA AA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATG ATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGG ATTG 720 

60 GAGGTTGGTG GGATAGGGOT CTCTGTGAAO AQATCCAAAA TCA TTTCTA G QTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGG AGATTTT CAG ATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AG ACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AG AATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 

65 TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTG AT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 
70 Protein Accession #: XPJ>59098.1 

1 11 21 31 41 51 

MSYTTMCIKE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITVVLSIWGL HHNPAVWKNP 60 
75 KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE LKVTIALILL HFRYTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession »: NM_0309 1 6 

Coding sequence: 1-1533 (underlined sequences correspond to start and slop codons) 
1 11 21 31 41 51 

ATGCCCCTGT COCTGGGAGC CG AG ATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTG AGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA OGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 180 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTG ACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGGATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAG ACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTG ATCGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGOCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
G AGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: S4 Protein sequence: 
Protein Accession U: NP J 1 2 1 78. 1 

] 11 21 31 41 51 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TVVLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRJTHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VVVGVIAALL 360 
FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENS1RR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETOTEL LSPGSGRAEE EEDQDEGKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AF007170.1 

Coding sequence: 73-1 725 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATG AT GACCTTTGAC CCTCAGG ACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAG ATGCT GTGTCAG AGG CACCGG AGGA AGTCTTCTGT AACAG ATTCC 300 
TTCAGCAGCC TGGTG AACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AAC ATGGTG A GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
G AAGGAGG AG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTCGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCG AGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTCTTC 840 
TTTGCAGGGA GG ATTGAAGT CATTAAAGGC AACATTG ATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTG GTGCT TCACCTACAA GGGCCAGTGG AAG ATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGG AGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTAC^AGC 1080 
ATGTTTGGGA AGG AGG ACCA CAAGCCGTTC GGGG ACGACG AAGTGG AATT ATTTCGAGCT 1140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGG AAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGG AAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGG AG A AAGGCCCAGA GAACGAGTAC 1 380 
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TCAGTGGATG ACGAGTGCTT GGTG AAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGOC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAG ATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG c ^^^^^^^^^^ * ^ A 

GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CA^GA^AC 1620 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTOTCC 1680 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAC5CAGTTJ 1740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGG AGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AOAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2 00 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CI ^ C ^JJ 1 [^ n 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGG ATGT ACACAGACAC 2280 
CCACTACCTT ACT ACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTT AAAAATCJ 1340 
AAGCACGTCT TTTT AGTG AG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAG AAAAT GCCAGOGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGCG GGCAGG AGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTG ACAGCAT TTTTGTTAAA ACTGTTATTC TTG AAAAAAA AAAAAAAAAA 2640 
AA 

Scq ID NO: 56 Protein sequence: 
Prolein Accession #: AAC39582.1 

I 11 21 31 41 51 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
OMLCORHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGG V KLGVGAFNLT LSMLPTRILR 1 80 
LLEFVGFSGN KDYGLLQLEE G ASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGOWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKLAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEl 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CUCYLGRVQE AEENFRSISA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM 006670.1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TrCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 180 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTG ACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CmACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTG A CACATCTAG A AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA^TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTC(XCCATC OTOCAAACC ^720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTG AACC OCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAG AG ATGC CTGCAGOGAT 840 
CACATGGAAG GGTATCATTA CAGATATG AA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGG ATGTCCT CGAGTGA 

Scq ID NO: 58 Pmtein sequence 
Protein Accession #: NP_00666l. 1 



1 11 21 31 41 51 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
OCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PH1RVFLDNN 180 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPDLPPSLQT 240 
SYVFLGIVLA LIG AtFLLVL YLNRKGDCKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Scq ID NO: 59 DNA sequence 

Nucleic Add Accession »: NM 024022 
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Coding sequence: l-1362(undertined sequences correspond to start and slop codons) 
1 11 21 31 41 51 

5 ATGQGGGAAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCO ATC GCTTTTTGGC 60 
CTTGATG ATT TGAAA ATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGG ATCAT TGCATTGATA 180 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCG ACTGCT CAGGG AAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTG ATAGCT CGATGTGACG G AGTCTCGG A TTGCAAAGAC 300 

1 0 GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGG AAGAC CATGTGCTCC GATGACTGG A AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTG ACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 

15 ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGG ATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGG ACAAT 840 
CCAGCCCCAT CCCACTTGGT GG AGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 

20 CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGA CAGC 1200 

25 GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
GACTGG ATCC ACGAGCAGAT GGAGAG AGAC CTAAAAACCTGA 

Seq ID NO: 60 Protein sequence 
30 Protein Accession #: NP_076927 

1 11 21 31 41 51 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP HVIGIIAU 60 
3 5 LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 
VTALHHSVYV REGCASGHVV TLQCTACGHR RGYSSR1VGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEK1 VYHSKYKPKR 300 
LGNDIALMKX AGPITFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
40 VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DMA sequence 
Nucleic Acid Accession #: NM 006475 

45 Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

AACAG AACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

50 TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATG ACAAGAT CTT GGCT CAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACG ACTG TTTTATATGA ATGTTGCCCT GGTTATATGA G AATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATOGT GGGAGCCACC 360 

5 5 ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTG AGGGAGG AG ATCG AGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGG ACAACT TGGATTCTG A TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAG AGAATGT TGACCAAGG A CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCG A ATC 660 

60 ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATT G ACC G TG TGCTT ACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAG AAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATG AGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

65 TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTG A TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

70 GCATTTTCTG ATG ATACTCT CAGCATGGTT CAGCGCCTCC TTAAATT AAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCG G AGGC A AACAG CTC AG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGC GAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

75 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAG AAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1 800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
5 CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AG ACTGATTA AAGAAGGTG A AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTG A TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

1 0 G AACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
AC AG AAG AA A CTCTGAAOAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG A^TGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAG AC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

1 5 AATAACCTGA CCTTAG AAAA TTGTGAGAGC CAAGTTGACT TCA GGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTG AATG A 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTG ACATTA AAAGTTCTGG CTA ACTTTGG AATCCATTAG 2820 
AG AAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

20 CCATTGAAAA GACCG AGCCT TGTATGTATG TTATGG ATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTT TTTAT A 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAG A GAATTACTTC AAATTGAGTA 3 180 

25 ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Scq ID NO: 62 Protein sequence: 
Protein Accession U: NPJK)6466 

30 l II 21 31 41 51 

MIPFLPMFSL LLLUVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PG YMRMEGMK GCPAVLPIDH VYGTLGIVG A TTTQRYSDAS 1 20 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMT NKRMLTKDLK 180 

35 NGMUPSMYN NLGLFINHYP NG VVTVNCAR IIHGNQIATN GVVHVDDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAAITSD1L EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTIE 1GCDGDS1TV NGIKMVNKKD IVTNNG VIHL 360 
IDQVUPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKLILQ NH1LKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

40 RNGA1HIFRE IOCPAEKSLH EKUCQDKKFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ELIRDKNALQ NI1LYHLTPG VFIGKGFEPG VTNILKTTQG SKIFLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHVVDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 
FKEIPVTVYT TKIITKWEP KJKVIEGSLQ PUKTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPU KKYTKIIDGV PVEITEKETR EERHTGPEI KYTRJSTGGG ETEETLKKLL 780 

45 QEEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 
Nucleic Acid Accession #: NM 020974 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCC CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGG ATGTAGA TG AGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGG AGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTOATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATG AATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTG AGCTGGC CAAGAACCAG AGAG ACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAG AGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGTGTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAG ATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CaSa^I^ AG AG AAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGG AGCCC CTGGCCGACC AAGCACCCCT AAGG AAATGT 1740 
5 ^AYCACTGYT^GTTTGAG CTTGAAACTA ACGVAAAGGAGGTGACAGCTTCTTGTGACC 1800 
TCAGCTGCaVcOT^ ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCT^GA^ 1 860 
AGGCCGTCCA CAGGG AGC AG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
CAG AACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1 980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

10 GCA^TG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGT^CCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTCAAT 2160 
GTCGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
gStTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 

15 G^SIS ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGG A AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACG ACTG 2460 
ACTTTC ATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGG A GGGGAGCT^^ 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCA AC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAG A 2640 

20 TCrrCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCOTCAT 2700 
CCAATTCTGT GACAACATAT G AAACCTGCC AGACCTACG A ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATG A AGGGAACAGC GCTAGAGGGT 2820 
TCCAG^G^TCCC ATACGTG ACA TATGATG AGG ACTACCAGGA ACTCATTG AA G ACATAGTTC 2880 

25 TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACCTTCCAAA GTGTCCAGGT 3060 
TT^TG AG ACC TTACAAATG A CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 

CCG TATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAG ATATT TTGGTAAATT 3240 
30 G^ACTTGGTT T^TCTTTCXX AGCATCGTGG ATGTAGACTG AGAATGGCTT ^^JP^^L.^^. 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTG^CT 3360 
TTGGTCAGCC TAGGTG AG AC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGG AGTC 3420 
TGTAG^GGA^^G^iAGGCC^^ AG AATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTG ACCAG GCAG AACAGG 3540 
35 ^f^^^OGAGACCCCTGCAGG CTCCCTCCAC CCACCTTG AG A^TGjGGAGG 3600 
^ A^cTgWcCACAGCCT^^ 

AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

40 g "l in NO 64 Protein sequence: 

Protein Accession «: NP_066025.1 

I 11 21 31 41 51 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
^Sr^C^YQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
^HNOWC ^CLH^NGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK J80 
^CSMC^ APRGSVACEC RDCILTCKHG NGGCQHSCDD TADGPECSCH 240 

50 CTGWCSOV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKJXTDEK 360 

TVG SYEOQCH PG YKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGI^ «» 
SSDVTTIRTS VTFKLNEGK.C SLKNAELFPE GLRPALPEKH ^SV^^^^ N^J^^GKO^ 5 i? 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLXUCAVHREQ 600 

55 ffiS. DVAKKPPRTS EROAESCGVG ^^^ S ^J^J)^^^ ™ 
TFQNEEGQMT CEPCPRPGNS GAUCTPEAWN MSECGGLCQP ^S^^vS^^T^ 
PEAGRTSCFP CGGGLATKHQ G ATSFQDCET RVQCSPGHFY NTTTHRCIRC PVG^PEFG 780 
KKNCVSCPGN TTTDFDGSTN rrQCKNRRCG GELGDFTGYIE^NYPGNYPVANTEC^^ 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

60 SSgNS ARGFQVPYYT YDEDYQELIE DIVRDGRLYA SENKQEILKD KKLDCALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession U: NM_007210 
65 Coding sequence: 1-1869 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

ATGAGGCTCC TCCGCAG ACG CCACATGCCC CTGCX3CCTGG CCATG^TGGG ™COCCm 60 
70 F^rrnTrr TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 - 

ATGAACAACC TTAOAG ATTC AATGCCCAAG CICCAAATCA J^^^^p .(i 
ACTCTGTTCT CCATAAACCA GTCCTOCCTC CCTGGGTTCT ATACTC^GC TGAACTOAAG 300 
SmCTCGO AAOGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGA^GCTA_ TAAGAAGCAC^ 420 
TCTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAG A C ^™?S°^£?£r «0 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TG AGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TG AGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
1 0 ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGG A G ATCTGGGGA 1 1 40 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTQTQ GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAG ATGG CCCAAGAGAA ATCCTTCGGT 1380 
15 GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
20 CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACG AGGAA 1740 
TGGG AATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1 800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATT TCTA 1980 
25 AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG G GCCCA TTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAG G G A AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG G AG AATGGCT TG AGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 
3 0 AACATAGTGA GG ATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAG AAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTG AGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA A AAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 
3 5 CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGG ATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGG AAAAAA AAAAAAAAAA 2880 



Seq ID NO: 66 Protein sequence: 
Protein Accession »: NPj0O9 1 4 1 

45 1 II 21 31 41 51 

MRIXRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWUCSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFVTPAEUC PFWERPPQDP NAPGADGKAP 120 
QKSKWTPLET QEKEEG YKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 

50 VIIVFHNEAW STLLRTVYSV LHTTPAILUC EULVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PIXARIAEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPDC SPTFAGGLFS 360 
IPKSYFEFDG TYDNQMEIWG GENVEMSFRV WQCGGQLEH PCSWGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KIFYRRNLQA AKMAQEKSFG D1SERLQLRE QLHCHNFSWY 480 

55 LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSfCGA LGLGSCHFTO KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Scq ID NO: 67 DNA sequence 
60 Nucleic Acid Accession #: NM_014112 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codous) 

I 11 21 31 41 51 

65 TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTO TCTTGAAACT OACGTAATGC 60 
GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT (XTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCT CTCTC 300 

70 TCTCTTTTTT AATAAGGGTG GGGGAGGG AA AGGGGGGGGA GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAG ACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCA ACAG AT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTG AG CGTCTTCTA A GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAG AATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 

75 TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGA^CGTTGC^GTGAAGGCXjAGOGCCAGATCCTGGAGCCTATAGGTACAGAAA 720 
GCAAGGTATC TGG AAAGAAC AAAG AATTCT CTGCAG ATCA G ATGTCAGAA AATACGO ATC 780 



40 



A 



376 



10 
15 



60 
65 
70 
75 



WO 02/059377 



PCT/US02/02242 



.^.^rr-ATr-r- -rnrAfiAACTA AATCATAAGO AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
m^G^G TA^GAAGG^C TTC3AAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
^T^r^ r^T/^GGGA QGMMXTVC CCTCCTTTCC GCATO ATG AG GTGACAGACA 960 
rl^^M^^CAm^AGCTG CTGGGGGAGT CIGTGAGCCC TTG AAGTCTC 1020 
CG^S^ AG^SaSSg ATATGGCCTG CACCCCCTCA GGOOACTCAC IMO 
TrrVrAPA A A GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 MO 
3 J££%£ AG^aTtoT C^GGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
MctSc^S^CTOTAAGaS-GA ATAAATCCAA AACTGACTTA CTGCTGAATG 1260 
AcScC^GArcCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
r a™t£ttta rTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 

wggI^taaS^ 

ItaacSg^gttcagccattccaaag acttccagaa GGTCAACCGT TCTGTGTTTT 500 
CTGGTCTGCT GCAGG ACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATC 1 560 
TGCAGGTCACTTCAGGTGGA ACATTCATTG GCATTGGACG G AAAAC A^A GATTGCC AAG 1620 
G^CACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
^A^TAGA ACAACATTTT CTTCAG ACTC ACCCAAACAA AAT AAA AGCT TCTCTCCCCT 1740 
CCTCTOAGCTTGCAj^AACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT KACTTCAAT 1800 
SaCTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1860 
A^ACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAG A CAAAATGGTA 1920 
90 C^SaC^GTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TC*TCTAGCT 1980 
CA^a7a^GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
A^SaGAGTT AAATGATAAG CTTTCCAGGG GCTCTOTCATTAATCAOAAT GATCTAGCCA 2100 
^ T^rTTr ArA A^GAGAGACA ATG ACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 

a^aJttct^wg^ggg^^^ 

9 C f-Tf ACTTCCG ATATTCCAAAAGCCATGGOC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
CTCaSItca ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT C^CTCTC 2340 

ccagSctScccag^ 

pi ii a a fJTA a TTfiTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT UUUULUUC 1 u/*tw 
C a^ctS2£ AGTCAaXcAT CAGTCCCATC AGTGTTCATT CACCACCCCT GACGTAGATC 2520 
O ft Sa^St^ TCACTATGAA ACTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
3 JaaIt^OTGCAAGGATCGGATGGGCAGC AGTCTGTCMGGAAAGCAAAGAACACTCAT 2640 

^^A^ TOA^AtTaCCCAAGTGG AAGAAG AGAT TTCCCGACAC TACAGGAGAG 2700 

a S^SSctG ^CGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
A^CAGTAT^TC TMIaTCGGG A G AAAACAAGT CCAAGG ATGA ATOCCAO^CC^TOTXjJ^OOAKKIO 

l^^r^^^AT-S-^^QCAG^GlCCT GGTCAGCCAA ACTCTGGATA 3720 
50 SSSSmO~W 3780 

55 TGAGTCACGT GCCTGGCCTA CC AAATCCTT ^^^^^j^f^^J^^^^^^fP^QQQ^ 

ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA ^GATTTGGCGA 4140 
TriAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
? AO^lS aJS^TC^ GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 

GC^GCATCTTTGCACGGACAAATATGACTTCACAACACATATC^ 

^aJaYJ r I a AfiAG ACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
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ATAGATGQAT ACAAATGTCA GACCGTGGGT TTAATTTGTT ™ G ^^I?S?*JJ^™ 
CACAAGGTAACCTGCTGTATTTAmAmT^ 

GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTG A CAOTGGTAT AT^CCAGC 5580 
m CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTOTAG AA 5640 
5 • GTATGTCCTC ATAG AACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGA AGCG^A 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTG A ACATTA AAJAATJGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC^GGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT 7TTTTGCCAC GGAGATGTAG S3S0 
AAgTtaTTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCA^^ AGAAGTTGGG 5940 
10 AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATT AATGTAT TACATTTCAA 6000 
A?A^T^GCCYGAC^^ 

TTCCATCATG CATTAG AGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
S^CAAG^^^^ 

GTGCATTAAA GTTGTAAACG GTAACATG AA ACAAATGAAA GTTCTTGCTA TMTGGTATG 6240 
1 5 GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GG AAAAAAAC JAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAG ATGATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGT TTCATA T GAAACATTT 6420 
CAGAAA^TTTTGATAAGTGTTGCTGGAATCTGCA^^ 

TAGTCGCATT TGCACTCC AT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
20 TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG G ATGGCAGGT 6600 

2U ™ag™c^^ 

TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA GAGAA^GAG 6720 
AAAGCAAGAG ACAG AAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCT^ATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 

25 IItoagga^ 

GGATGCATCT CAAAATGTAT AGCCAGACTT GAG AjGGTGAC A^A^AT CTA^AGA 6*60 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT A^AAAG AAT A A^G AAT G C A 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGG ACTGCT TAAATCAGCA AAACACC AG A 7080 
AGTTTGGT^A ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 
30 CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 

AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GGATAGCGTT G ACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
PAT iiUGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 

CATCTCTCAA GAT AT ATT AT AAAATTTATT TTTTTAATTT A AGATTTCT O AATTATTTO 756^0 
TCTTAAATTG TGATTTTAAA CG AGCTATTA TGGTACGG AA CTTTTTTTAA TGAQGAATTT 7620 

CATGATG ATT TAGGAATTTT CTCTCTTGGA ^ GQ ^^^J^^^J^AT^ T740 
CC AGCTA AAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 

40 CT A^ATTGAG CTTTG G ATCA AACTTTAGGC C>GGACGAGC TCATGCGTO TCA^ 

TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT CGG ATAG AAA ^7860 
AATCATAAAG AGCCAACCCA TCTCAG AACG TTGTGGATTG AGAGAGACAC TACATCACTC TWO 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC ^^^JJ^^^^^^^t^ 80 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 

45 TCATCTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTTATAA AACAAGCATG 8 1 00 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG i 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTG AGO 8280 
AAATACATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 

50 CTC^^TT^T G^G^CTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT 9^AACTCCT 84* 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG G^TATCTGC CATCCAGTGG 8460 
TGGAAATGTGGAGTAAAGAAGCTGGTGGTCTGCT^^ 

TTAAGTTG AG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TCK3CAAAAAA 8580 
A^iA^SiAjS^AA AAAAAG ATCA AGTCGTCTTT GGTGAGCCAG ^GTG AAA GCTTGCTG AC 8640 
5 5 TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TG AAATGCA A 9^^A^ATC AAACTAAATA ^700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA ^AGGCAACT GCAAATG AGA 8760 
AGAAGATAGA AGGACGCCCG TCGGG ACTTT GGAGGGCATT ^G^TTTTCC ^ Q T ^ GA M ? 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGC^CTTC a ™^^ ™AGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AOTTGCTTAG AAAGCATCCC 8940 

60 a!£aa£gc^ 

cacatgaaga aaatgtgaaa ttcaagagta atccacacgt gagaaactag acaatgtaca 9060 

St^ 

GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTT ATCAG A GAAAATTGTC 9180 
CTTCTAGAGC ATAG AGTCCC AAAACTCAAT TCTGGTTTTC CCCTGTTTTT TTTTTTTTTT 9240 

65 t^c^ A ^ 

CCTGTAAAAT TG AAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGCTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTGAGATCCT TGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCA AGGGG GTTATTT AAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTO 9480 

70 TG AAG ATGCA A A ACT ATT AC TAGGCTGATA AAATCATAGT TTOTAATGC ^AC^TAA **» 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GAC^TTTGA ^S*^™ 9 ™ 
GAAAACTTTG GGGGAGGCAT G AGGGGGGAA CATCTCAAAA JSP^^J^^^J]} 0 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAGA 9780 

75 TCTTA.CAACA CATG AAATTT TTTCGTTTTA TTTTATTTTG "^JJ^J^^^^^^S^7. ^?J^ n 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTC A GG CAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TG AATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein sequence: 
Protein Accession »: NP 05483 1 

5 I II 21 31 41 51 

MP YE VN AG YD FTNMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSOTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 180 

10 QSGQANCQGL SPVSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFKOnC I 240 
GYGYYGNDPT DUKHFRKYH LGLHNKTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLODINSS RPVLLNGTYD VQVTSGGTFI G1GRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEOHFLQT HPNKIKASLP SSEVAKPSEK NSNKS1PALQ SSDSGDLGKW QDKJTVKAGD 420 
DTPVGYSVPl KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYGK QHG AVQSGGL 480 

15 NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIWGPLL RHYQQLHN1H KCTDCHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLOGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDriTANG E EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSESWKR 780 

20 EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQG AP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNUK Q^QDRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSL^SAQO?VLVSQTLC TlHKRMQPLHI QDCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

25 IEKYMRPAkS PNYSPPGSPI EKYQYPLFGL PFVHNDFQSE ADWLRFWSKY KLSVPGNPHY 1140 
CQ^YPTF NLPPHFSAVG SDNDIPLDLA nCHSRPGPTA ^AS^nCA1200 
PPNVKNEGPL NVVKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFOCS1 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

30 Seq ID NO: 69 DNA sequence 

Nucleic Acid Accessions: XM 073879 • 

Coding sequence: l-387(underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

35 ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGOCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGG AGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 
GC^cIaG^CAGG^ AGTAGTAAAG GTAG ACACTG TTCTCTTCGA GAGTTTATAC 180 
CA^GCG^^G^CACGGGAQCGTGAJG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 

40 G ACAG G A^G CTC^^CTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCG ACTCCT 300 
GTCTGTCCTC cScAO^CTC AACACAG AGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACJAA 

Seq ID NO: 70 Protein sequence: 
45 Protein Accession #: XPJ>73879 

1 11 21 31 41 51 

MGFGDQGTVE GSLGTSKKPP EVKMFGASQG IXTMETNQSL AQGTGC^^ VDTVLFKLY 60 
50 HCGFEPIGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 71 DNA sequence 
Nucleic Acid Accession U: AB033064 
55 Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 

I U 21 31 41 51 

GGGGAACAGC AAATTCAGTC ACAG ACAATC CTCCACTCGG TC A A GA^A CTmCTOT 60 

60 CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCT^GCCGOTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 180 
SgmShA ATATACTTTTGTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
GAGTG ACAAT AAATTAGCCC CrrCTTTTTT CCXTTatCATTCAGGOOOn ^CCTCTCC 300 
AG AGGG AAAT TACC AGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GA^AC^AAAGTGGACA TTATTTGACC AGTGAACACG ^CC^CTTC^A^GGjaO 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TCCCAATT^CCTTTrcACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
cSctC^GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAG AGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 TCTCGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
I^CGAAC AACATGTGAA GAG AG AGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGATGTG TAGTTCTGTG 840 
G CTGCCAAGT TCTC GTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TC^AAAAGAG 900 
A^T^^G^^CATTG AAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG G^jCTGTGGTG 960 

75 a^atcaggtcgtgctgac AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GA^GTCACATCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGG ACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTT ATTCGC 1380 
5 CATG AAGCGC CATACCTGTT CCAG AAGTAT GTTAA AG AGT CTCATGG ACG GG ATGTACGT 1440 
GTCATTGTCG TGGG AGGCCO TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AG ATGGGAGA 1 500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTG ATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
1 0 GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTGGCCCGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAG AGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
1 5 GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 

ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA A AATGTGTAT GCTCATAGGC C ATGAGG AAC 2280 
20 AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 

GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGf C ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCC ATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATG AAAT CTTGAGGGGC TCTTTTACTG 2520 
GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
25 CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATGATG AAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA T GTAGATA AA ATTACACTAG 2700 
TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTAATG TTGCTTTTTT TTTTTTTTT T TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAAC AA TGGCCAAAGT GAAG AAAATG CTA CCTTTT T TGTTAACAAG 2880 
30 ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 

TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCT ATGAG 3000 
TTCT AAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG G AGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GG AG CATG AT (K3GAAACAGA 3180 
35 GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AG ACTCCA CT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 
TGGGGGTGGG TTGG ATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 
40 CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 

AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACG AATC CTCTGAAGCT TTTACCCAAG (XCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
45 AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGG AAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATACTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAG AG AAAA GGATACATAA GAGG AGTTTG 4020 
TAATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATO 4080 
50 GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATG ACCAT ACTGTGO AGO ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA G AACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA ^ ACCATCTGG 4260 
TTGCCATGTG TATTATG ACA CATACACTTT GAATAGTTAC ATATCAC A AG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
55 ACCAAATTAA CCAACTATAT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTA^CAC ^4440 
TATTATG CAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAG AGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTG AATT CCTTCAAGAT TG AGGJAG AG 4 560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT ^4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
60 AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAA ATT TTTGTATATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCTGATAT TGTTTTTATG 4800 
TCACCCATG A TG AAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CJ^CTATT 4860 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTT^CTJ3 4920 
GCTTTTCTCC AATG AACATT GAAATCTTCC TGTATATGTT ACCAATAAG A AAACTACCCT 4980 
65 GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAG ACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTG AAAA TCATCTCTTT TTCTCAAGAA GAAAGCAATG 5 00 
G AG AAGCAAA TTTGTTTCAT TCAGTG AATC CCCAGTTTGG GGCTTGTGGG GCTTAG AGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACrrTTTTT a M^™JH A S ™ n 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAG ATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG 



70 
75 



Scq ID NO: 72 Protein sequence: 
Protein Accession*; BAA86552 

1 11 21 31 41 51 . 
| I I I I I 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEVV LTIEQGNLGL 60 
RINGEL1TAY PQVVWRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQA1L NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAJCMIDEAEV LEFPMWKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIWG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
5 LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGF1AFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS WSTASETSE PELGP PASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 
1 0 Nucleic Acid Accession #: XM 040080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 5 CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGG AG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGG AGT GACAGCAGCG CATTCGCGGG ACG AG AGCGATGAGTG AG AA CGCCGCACCA 1 80 
GGTCTG ATCT CAGAGCTG AA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 

20 TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GG ATTTCG GA 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTG AGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATGAAATGGA GAACTTGCTG 600 

2 5 ACCTACAAGC GGAGAGCCAT AG AGCACGTG CTGCAGGTAG. AGGCCTCCCA GGAGCCCTCG 660 

CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTG ATCAA AGCAGTCCAC 900 

3 0 GGATATTTTG ATTCAAG ACA G AATTACTCT G AGAAGG AGT CCCTGTCGTT CATGATAGAC 960 

ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGGAC 1200 
3 5 AGGCCTCACT AGTCTTGAGG CCC AGCCTAO G ATGGTAGTC AGGGGAAGG A GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAG AGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACt GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

40 Seq ID NO: 74 Protein sequence: 

Protein Accession #: XP_04 0080.1 

1 11 21 31 41 51 

45 MSENAAPGUSELKIjWPWgUaAKAWGSL^ 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAALKWN RFSILGHSFG GWGGMFFCT 120 
FPEMVDKJLIL LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 180 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDFISRELCA HSERKLQAHV 240 
LUKAVHGYF DSRQNYSEKE SLSFMIDTMK. STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 

50 SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 
Nucleic Acid Accession #: NM_005794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 
CTTCACTCCT GGGGCCAGCA AG ACCACG AA TGCACCGAGA GGAATGAACA ACTCTGGACA 180 
CACCATCTTT AAG AACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC AC TATG CTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGG ATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTG AACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTG ACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCTGAOAGO AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
CC 

Seq ID NO: 76 Protein sequence; 
Protein Accession #: NP_005785 

1 11 21 31 41 51 
I 1 I 1 I I 

MLSAVARG YQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAG1 VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQ1XPYM ENRRGAVILV SSIAAYNPW 1 80 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 



SeqIDNO:77 DNA sequence 

Nucleic Acid Accession #: NMJD02425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

AAAG AAGGTA AGGGCAGTG A OAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGG AATG CAG A AGTTCC TTGGGTTGG A 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGG AGG A AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAG AGGT GACTCCACTC ACATTCTCCA GGCTG TATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAG AA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 
TATTCACTTT GATGATGATG A AAA ATG G AC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATG AACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTG AAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAAC CCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGG ATC TG AG ATG CCA GCCAAOTGTG ATCCTGCTTT 900 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG A CAGAT ATTT 960 
TTGGCGAAG A TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACT AATAGC T GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT G ATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC AT ATTAAA OA GTAACAGCTG 1440 
GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGO TGTTTTTAAT AAATCTAATA 1500 
ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1 560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATOT GCAA T AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 

Seq ID NO: 78 protein sequence: 
Protein Accession #: NPJW24 1 6 

I It 21 31 41 51 

MHLAFLVLLC LPVCSAYPLS G AAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNUVK 60 
K1QGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 
PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 
RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FTFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PT1RKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRUADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RM VTHHiCSN SWLHC 

Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession U: NM 000493.1 

Coding sequence: 1 -2043 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTGA ACTTGGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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r AGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 1 80 
a^Sttt^ CAOCAGOCOC i TGCTGGACCT CGAGGGCACC CAGGTCCTTC TGGACCACCA240 
GgSS^CC^G GCwSiG^G TrcTCG ACTC CAAGGAGAGC CAGGGTTGCC AGGACCACOJ 300 
GGAeCATCAG CTGTAGGG AA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
5 GGACCATATC gTcCAAAAGO AGATGTTGGA CCAGCTGGCC TACCAGGACC O^GGCCCA 420 

r ArrnACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGuA 5W 
S^OTATCaTtcSo^GOGGAA ATGGGATATG CTGCTOTGG raTOCAGGT 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTCGCCCTCC TGGAGTGGGA 660 
10 AAAAGAGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
fifiAfiAAATGG GACCAATTGG CCCAOCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGG^TO GaSgoSgG AGCTGCTGGA GCCCCAGGCC AGCCAGGGATTCCAGGAACA «0 
a AArGTrrre CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTCAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
1 c AAfifi^ A>U^V^GGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 

15 r rrAAT ATGG GACCCCAAGG ATCAAAAGCC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
A^GCTG^A CA^M^^Sc TCGGCCTGCA GG ATACCCTG GGG ^^^?,2J5^TrwA Ar*i inn 
TfVY^raGmn^AGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
r^AACTCAG GGTC^CAGG TCCAAAAGGT GATCCTGG AG TTGGAGGACC TCCTGGTCTC 1260 
90 ^I^^O TCGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AG AGGCTGGC 1320 
20 ^t^^O C£OTGG AAT A^AGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 

g^SIg^a^g^ 

rr * irra agg gcctcaatgg acccaccggg ccaccagggc ctccaggtcc aagaggcccc 1s00 

?c^Sg!gC C^TOTCC AG^CCCCCT GGGCCTCCAG GCCCACCAGG TC^GCAGTC .560 
25 ItIScT^AGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGACCCCTCTTGTr 620 
25 AG^CCAG^TAACAGGAATGOTWG^ 

cXTGr C C^A C GAC^G C ™ 

ta rr ACGTGC ATGTG AAAG G GACTCATGTT TGGGTAGGCC TGTATAAG AA TGGC ACCCCT 1860 
"2.0 ^ta^TGtI™ CCTATGATGA ATACACCAA A GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
3U ^^^t^a^^aCAGAAAATGA^AGGTGTGGCTCCAGCTTCCCAATGC CGAGTCAAAT 1980 
rr^CTATACT CCTC^G^G^TC'TCCACT^CTCnTCTCAG GATTCCTAGT GGCTCCAATG 2040 
Tr ATT ACACC (X^iCAGACK^ V^TCTAAATC TTGTGCTAG A AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATCGAGGTAG GCTG AAAAG A ATGTAATTTT TATTTTCTGA 2160 
o c ^^r^rtTXH^MCA GACCAACAAA CCTTCCCCCT GAAAAGTGAG CAGCAACGTA 2220 
AAAACCTATG TC AAGCXTCT CTTG AATITC TAGTTAGCAA J^J/^^^Fj^Tl^^^j^ 2 ^^ 
•rrrrrr A ATA TTAAAAAATA TCACCAAAG A AGTCCTGCTA TGTTAAAAAC AAACAACAAA 

?a£™aa£™^ 
T g£tatt£™a^^ 

TriTfirnxyr ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
rTT A AAGCTC TTATA^A^ AACHXCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
^^^rArrT-^AAA^T^C TATGATTGCA G AG AAG CTTT TT ATAT ACCC 2760 

45 A^TAi^ 

m^A^CT TTAATTCCTT ATTGAATCTT ATGTG AT ATG ATTTTCTGGA ™A°^2880 

a^ag™^ttgtgcctg™^ »J 



35 
40 



50 



ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAA 1 1 1 ALA uuAuumi. 
paa AATTMA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
5t^S GCTGTGCCIC ACTTATTAAAGCACAAAATGTTrTACCTAC 3060 
tSSaTtTa C^CA^ AAATAACATC AATAGATTTT TAGC^GAAT TAATTTGAAA 3!20 
GCA^AATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAAT AA 3180 
CATCAATAGA TTTTTAGG 



m NO gn protein sentience: 
55 Pmiein Accession ft NP_000484.t 



1 II 21 31 41 51 
60 Li jJlPFLll VSLNLVHGVF YAERYQTPTG [KGPLPKTKT QFFIPYTtKS KG1AVRGEQG 60 

™gp^prg^potppg^gy^ no 

V^rvJrnvr v A^PGPRGP PGPPGIPGPA G1SVPGKPGQ QGPTGAPGPR GFPGEKGAPQ 180 
PO^PPGERGP^GroCTGA^APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
rtS p^lKgSg^G^^^ 360 

kg^gpag^g^ 

roPWPAGAK G^mraEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

GLYSSEYVHS SFSGFLVAPM 
Seq ID NO: 8 1 DNA sequence 

Nucleic Acid Accession U: NM_000786 Mt4 . . 

75 Consequence: 332-1861 (underlined sequences correspond to start and stop codens) 



| tt 21 31 41 51 
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CGCGATTCTC AGGG ATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 180 
5 GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGG A CGACCATCTG CCAGCTTCTC TCG1TCCGTC G ATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGG A TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAG AAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
10 CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGG AAAAGTC CAATTG AATT 600 
TCTAG AAAAT GCATATG AG A AGTATGG ACC TGTATTTAGT TTTACCATGG TAGGCAAG AC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 
15 ATACGATGTG CCTAATCCAG TTTTCTTGG A GCAGAAGAAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTG AAAAA G AAACAAAGG AATACTTTG A 900 
GAGTTGGGG A GAAAGTGG AG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGG A AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAG GTTG 1080 
20 GCTGCCTTTG CCTAGTTTCA G ACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTG ACT GATG ATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAG ACA AAACACTTCA AAAAAAATGT TATTTAGAAC AG AAAACAGT 1380 
25 CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC A ATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CAmGGAGC 1680 
30 TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAGA CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTG ATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAATS 1860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATG AAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
3 5 AAAGCCAGTT TATG ATTTAG G ATTTTGTTA ACTG A ATGGT TCTATCA AAT ATAAT AGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGG A TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAG AGTACTC ATCGCTCTTG CCAAAT AAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AG A ATA A AAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 
40 ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCGA 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGO AAAAAG A AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAG AG AT CCAGG ACCAT ATTTG ATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
45 TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCT TA 2640 
GGG GAT AATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC TTTTTTTTTT TTTTTTTTTT 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAG CA AT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA G ACGGGGGTT 2880 
5 0 TCACTATGTT GGCTAGGCTG GTCTTGAACT CCTG AGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATT AGG GATG TTTGTTACTA CTCATATTG A TT AAAACAGT TAACAAACTT 3060 
TTTTCTTrn AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 
5 5 TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT G AAAAA AT AC CACATCGTCT 3240 
GTTTTCACTA GOTO ATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC T TTTTAAA AC 3300 
ACTG G AAG AC TCATGTTAAA CTCT AATTG T G AAGGCAG AA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

60 ^1 m NO: 82 Protein sequence; 

Protein Accession tf: NP_O0O777 

1 U 21 31 41 51 

65 MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG N1XSM1XIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPY1 FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
A1XFNSKNED LNAEDVYSRL TTPVFGKGVA YDVPNPVFLE QKKMLKSOLN IAHFKQHVSI 180 
IEKETKEYFE SWGESGEKNV FEALSEUIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREIKDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 

70 PLTDDEVAGM UGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DOLKDLNLLD RCDCETLRLR PP1MIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQIKTIWS TMLRLYEFDL 480 
1DGYFPTVNY TTMIHTPENP VIRYKRRSK 

75 Seq ID NO: 83 DN A sequence 

. Nucleic Acid Accession #: NMW6551.2 
Coding sequence: 64-336 (underlined sequences correspond to start and stop codoos) 
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1 11 21 31 41 51 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
5 ACCATG AAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTAOCAGGCC 120 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT G ATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTG A AAATATTG AA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
10 TCCACTGTCT TTCAATG ACA CCCTG ATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Scq ID NO: 84 Protein sequence: 
^ Protein Accession #: NP 006542.1 

1 11 21 31 41 51 

LoSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVK1LKKCSV 

20 

Scq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: NM_002317.I 

Coding sequence: 23 1-1484 (underlined sequences correspond to start and stop codons) 
25 1 U 21 31 41 51 

GGGOCAGGAC TG AG AAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTGGCATT GCTTGGTGGA GACTGAGATA CCCGTGCTCC 120 
GCTCGCCTCC TTGGTTG AAG ATTTCTCCTT CCCTCACGTG ATTTGAGCCC CGTTTTTATT 1 80 
3 0 TTCTGTG AGC CACGTCCTCC TCG AGCGGGO TCAATCTGGC AAAAGG AGTG ATGCGCTTCG 240 
CCTGG ACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
- CCGCCGCCGGCCAACAGCAGCCCX:CGCGCGAGCCGCCGG 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
35 CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCG AAGC TGGGCCCTCG CGCGCGG AG A 660 
ACCAG ACAGC GCCGGG AGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
45 ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATG AG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATG ATA CCTATGGTGC AGACATAG AC TGCCAGTGGA 1320 
TTG ATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
5 0 ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTG ACATT CG CTAC AC AG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TJAfiAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAAT AG ACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
55 TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Scq ID NO: 86 Protein sequence: 
Protein Accession #: NP_002308. 1 

I 11 21 31 41 51 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWBN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILURDNRTA AGRTRTAGSS GVTAGRPRPT 120 
ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 180 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YiQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQW1DITDVK PGNYILKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Scq ID NO: 87 DNA sequence 
Nucleic Acid Accession ft NMJ006419.1 

Coding sequence: 9 1-420 (underlined sequences correspond to start and stop codons) 
75 1 11 21 31 41 51 

TTCGGCACTT GGG AGAAG AT GTTTG AAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCT A TTACACAAGC 180 
TTGAGGTGTA GATGTGTCCA AG AGAGCTCA GTCTTTATCC CTAGACGCTT CATTG ATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGG AAGAAG 300 
5 AACAAGTCAA TTGTGTGTGT GG ACCCTCAA GCTGAATGGA TACAAAGAAT G ATGG AAOTA 360 
TTG AG AA AAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAG AGAAA GATTCCCTgA 420 
TGCTG ATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGG AG AAAGAAC1TC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TG ATGGGGCA AACTCAAGCT TCTT CAC TCA 600 

1 0 CAGC ACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGG AGG AA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG G ATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAG AAA ACAATGGAAT GAGAATTTAA GCCTCAAATT TG AAC ATGTG 780 
GCTTG AATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGG AACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 

15 TAG ATATCCT CTGCTTA AAA ACTCACTACG G AGG AG AATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
G ACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1 140 
GGACTCTGGT ATCTA ATTCT TTAATG ATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 

20 AGCAAACATT TTACTT 

Seq ID NO. 88 Protein sequence: 
Protein Accession #: NP 0064 10. 1 

25 1 II 21 31 41 51 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 
PRKEIIVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

30 Scq ID NO: 89 DNA sequence 

- Nucleic Acid Accession #: NM_002652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 

1 11 2t 31 41 51 

CTTCTGTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGG ACAACA CTCGG AAG AT CATAATAAAG AATTTTG ACA TTCCCA AGTC AGTACGTCCA 1 80 
AATG ACGAAG TCACTGCAGT GCTTGCAGTT CAAACAG AAT TGAAAGAATG CATGGTGGTT 240 

40 AAAACTTACC TCATTAGCAG C ATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACG ACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TG ATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTG AAA TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTG A TTTCCTCTAA AG AAACTTGG CTGGAATTTC 540 

45 TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq ID NO: 90 Protein sequence: 
Protein Accession U: NPJJ02643. 1 

50 I U 21 31 41 51 

MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
EUCECMWKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVIREL 120 
GICPDDAAVI PIKNNRFYTI EILKV 



55 



Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: AK00034 1 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 



60 1 II 21 31 41 51 

G ATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGOAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCGA GATTCTCGAG TCAGAGGGTG GTTCACGTTG 180 

65 GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GG AGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTG A CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 

70 TTCGTTTTGC GGAAAAAAAC GAGTCAGATT ACTTTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGG ATACCTT GTGG ACAAAG TTTCTTTGGA 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGG ACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 

75 QGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 

TATGTTCAGA CATACCG AAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAG AAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 
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AAGAAAGCAC AATA A AAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTGAATGC TTTGATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
CAGCTTCTCT CTCCCTCCCT CCCACAGACT GAACCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAG AAAAAT 1500 
CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG IV 740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 
TTCTG AGCAA GCACTGGAGT CCACTCCGAC C1U11C11T GAACATGCAT GCTGCTGGAA 1 860 
TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTG A GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGACTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 
CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 21 60 
AG ACCAAAGT ATTAATGAGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAA AAAAAA 

Scq ID NO: 92 Protein sequence: 
Protein Accession #: BAA91096. 1 

1 11 21 31 41 51 

MEHLK AFD D E ENAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAEULST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 
FSKSVEFLDT IFFVLRKKTS Q1TFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLS VFPSM HKYLW WKKYLTQAQL VQFVLTITHT MSAWKPCGF PFGCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANG V MNKKAQ 

SeqIDNO:93 DNA sequence 

Nucleic Acid Accession*: NM_000044 

Coding sequence; 1 1 15-3874 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGQACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCG A GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGG ACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACOCCTCTTCCAGCG 180 
CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGA AG TTT^ AAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTOTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCC^G^CTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 
TTGGCTGCGA GCGGGCG AGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCG A 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG ™GAATTCCGGCGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCG AGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAA ACAAAC 720 
AAAAACAAAA AAGCCG AAAT AAAAGAAAAA GATAATAACT CAOTTCTTAT TTGa^CTAC 780 
TTCAGTGGAC ACTGAATTTG GAAGGTGGAG GATTTTGTTT TTTTCTITTA AGATCTCGGC ^840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAG AGCG CTTTTIGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA ^ G ^ G ^^^^ l< Z n 
GGGAAGTAGG TGGAAG ATTC AGCCAAGCTC AAGGATGGAA C3TGCAGTTAG GGCTCGGAAG U40 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC 7^CCAGA^G1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA G AGGttCG A GttC AGC ACC 1260 
TCXXX3GCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAG A GACTAGCCCC AGGCAGCAGCAGCAGCAGCA H 
GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TCCKCCTGG A 1440 
TGAGG AACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGj\GAGGTTG 500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACm !620 
CCCCGGCTTA AGCAGCTGCT CCGCTG ACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1 680 
GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT T ACTTAG GGG GCACTTCG AC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG G TGTCX1ATG G G CCTGGGTGT 8 60 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG G ^ GG ^^j2^^^f^^9 . n J?? 
ACmTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT ^^I^2r^?^7^^ Z n 
AGGTTCTCTG CTAGACGACA GCGCAGGCAA GAGCACTGAA GATACH^CTG ACTATTCOCC ^2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCG AG AGC CTAGGCTCCT CTG GCAGCGC 2 100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGGAC GAGGCAGCTG CGTACCAG AG TCGCG ACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGG ACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
5 TTCCTCATCC TGGCACACTC TCTTCACAGC CG AAGAAGGC CAGTTGTATG GACCGTGTGG 2460 

TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGO 2520 
CGGCGGCGGC G AGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
cagagtgccc tatcgcagtc ccacttgtot CAAAAGCGAA ATGGGCCCCT GQAT GGATAO 2700 

1 0 CTACTCCGGA CCTTACGGGG ACATGCGTTT GG AGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC ITTCCACCCC AG AAGACCTG CCTGATCTGT GGAOATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
OAAACAGAAG TACCTGTGCG CCAGCAG AAA TGATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCX3TC TTCGGAAATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 

1 5 GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CG ACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGOAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG (XTGGCTTCC GCAACTTACA 3300 

20 CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCG A TCCTTCACCA ATGTC AACTC CAGGATGCTC TACTTCGCCC CTOATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCQ OATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 

25 ACTTCGAATG AACTACATCA AGGAACTCG A TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CACCAAGCTC CTGG ACTCCG TGCAGCCTAT 3720 
TGCG AG AG AG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GG AAACCCTA TTTCCCCACC 3900 

30 CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AG ACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 

3 5 TGTTTACAGC ACTACTCTGT GCCAGCCACA C A AACGTTTA CTTATCTTAT GCCACGGGAA 4260 

GTTTAGAGAG CTAAGATTAT CTGGGG AAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 



60 



Seq ID NO: 94 1 

40 Protein Accession*: NP 000O35J 

I 11 21 31 41 51 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV REV1QNPGPR HPEAASAAPP GASLLLLQQQ 60 
45 QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 180 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
50 DYYNFPLALA GPPPPPPPPH PHARDCLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 

GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YG ALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CT1DKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETTQKLT 660 
5 5 VSHIEGYECQ PIFLNVLEAI EPGVVCAGHD NNQPDSFAAL LSSLNELGER QLVHWKWAK 720 

ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSUPVDO LKNQKFFDEL RMNYKELDR 840 
QACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LIKSHMVSVD FPEMMAEHS 900 
VQVPKILSGK VKPIYFHTQ 



Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: NM_002497 

Coding sequence: 135-1472 (underlined sequences correspond to start and stop codons) 



65 1 U 21 31 41 51 

GGCACGAGTA GGGGTCGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGQCAO CGCGACTCTG 120 
GCGACTGGCC GGCCATGCCT TCCCGGGCTO AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 

70 CAGGCTCCTA CGGCCGCTGC CAOAAG ATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACTTGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TG AATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATG AT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGG AATATTG TGAAGGAGGG OATCTGGCTA 420 
GTGTAATTAC A AAGGG AACC AAGG AAAGGC AATACTTAG A TGAAG AGTTT GTTCTTCGAG 480 

75 TGATG ACTCA GTTG ACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAA ATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATG AGTT ATGTGCATTA ATGCCTCC AT 780 
TTACAGCTTT TAGCCAG AAA GAACTCGCTG GG AAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTG AT GAATTGAATG AAATTATTAC G AGGATGTTA AACTTAAAGG 900 
5 ATT ACCATCG ACCTTCTGTT G AAGAAATTC TTG AG AACCC TTTA ATAGCA G ATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

10 GCTTGCTAAA GG AACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAG AACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTG AGAA TTCTGAG AGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTG AAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAG A TATTG AG AAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAG ACT GATATTCAAA TGCTGTAGTG TTG AATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA A AG AATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAG A TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTC CATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTG AA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
XTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 

25 TAAAAG AATT CTTCAGTTA 

Seq ID NO. 96 Protein sequence: 
Protein Accession U: NP 002488 

30 1 11 21 31 41 51 

GIREFNPNIM ANEVERINM! TOSISGENEA RELATEDK1N ASEHOMOSAP IENSMPSRAE 60 
DYEVLYT1GT GSYGRCQKJR RKSDGK1LVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID RTNTTLY1VM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 180 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
OMNRMSYNEK SD1WSLGCLL YELCALMPPF TAFSQKELAG KIREGKFRRI PYRYSDELNE 300 
ITTRMLNUCD YHRPSVEE1L ENPUADLVA DEQRRKLERR GRQLGEPEKS QDSSPVLSEL 360 
KJJCEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENIXKNYS LLKERKFLSL 420 
ASNPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCKDLKKR LHAAQLRAQA 480 

40 LSDIEKNYQL KSRQILGMR 



45 



Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession*: NM 007050.2 

Coding sequence: 1 85-4576 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCG AGCGG A GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCG ATGCG GGGCCACCCC 120 
50 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 180 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGAACAG 420 
5 5 CTCTGGG AG A GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTG A AGGAG AATGA 480 
CACCC ACTGC ATCG ACTTCC ATTACTACTT CTCCAGCCGT GACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTG AAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GG AATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTGAAGGGT CATCCTGGCT ACATCGCCGT 720 
60 GGACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTG AT GGTGGGTCTG GTGTGTCCAA 1020 
65 CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GG ATCAAGCC AAATGCCAAC TCCATCATCG GGG ATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCG ACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAG ATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 
70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG OCCCCCTTCA TG ACCATCCG 1560 
GCTGCGACTCTTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGG AGC TGGTGGTGCA 1620 
75 GACTGAGGAA GACGTTCCAG G AGCTGTTCC TCTAGAATCC ATCCAAGGGG GGOX^TTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGT^ TCA^CTCTA 1740 
CG AG ATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1 800 
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GGCG AAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TOTACCCAGG 1860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT OAGTACG ACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT G ATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
5 TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGG AGGGC 2 1 00 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGOAATGCCT CCAGCCTCG A 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAG A CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CA AAGCCAAT GG AGAGACCA AAATCAACTG 2340 

1 0 TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GG ACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAG AGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAG AAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGOAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGG ACGT CAACGGATTC ACAGATGGCA GCCGCGGOGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
G ATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACX) AGGCCTTACC 2940 

20 AGAGGGGCAG ACAGCTTCXiT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTG ACGGA TACCATCGAC CTCGGCACTA 3 120 
CATTGCG ACT CAAGGTCCG A TGCAGG AG AC TGTAAAGG AC TTTTGG AG AA TG ATCTGGCA 3 1 80 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG G AAGTGGGCA GGGTGA AATG 3240 

2 5 TGTGCG ATAC TGGCCAGATG ACACGG AGGT CTACGGAG AC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TG AGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 

3 0 TGACACCATG CTTGACATGG CCGAGAATG A AGGGGTGGTG GACATCTTCA A CTOCG TGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGG AAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TG AATTTCAG ACCCTCAACA TTGTG ACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

3 5 GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGG ACGG AG A ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 

40 GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCOACG AO GACATCATCC AC AG A AT ATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTG GCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAOAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGG AGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 

45 G ATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACG AOGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCOCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACC TTCTGA 4740 

50 GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGC AGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 

55 CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAG ACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTG ACC GT CCAGCA TT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAAA GGTGTTCTTT GCTTTTTTGT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

60 AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTGATT ATTTTCCAAA 5460 
GTGTATGTGA AAAG AAACTT TCTTTTGG AG GQTGTAAAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 

6 5 AG ATGGGTG A GCCTG AAG AT GGTCCCTTTG GTTTCTGGGG CAAATAG AGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGG AGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GG ATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 

70 CCTTAGA ACC CCCAGG AGAG AGCTACTG AC TTGTTATCAT CTCCATTTGA TCATCTCCTC 6000 
CAATGAGG AA ACCCACGCAC CTTCCTTAGT AATG AAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGG AGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GO A AATTTAG 6180 
TACTTTCTTC CTG ATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGGACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTG ACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGC^G AG^TTCAAG i^GGTACAGC 6480 
TTTGGCCCTC AAGAGOCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTOTGGA 6540 
GTCGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTOBWCTCa '^^n^^J 6 ^,. 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6S60 
CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
CTOTG^A GGCCAGCCCA GTGCGGGCAC AGCACAACAC AAGAGATGAG 
CCAGGGTCCA GGAGACAGAT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA G ATGCACTCA GCTTCGTGTT GGCCAAGACT OCITCTOQaC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTOCCT TAAGCGTCCA 6960 
CTTTTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 
SEKrc ^TCAG^TCA GATTCTATGAGAAACCCAGG GTCICTGCTTTAOCACGGAG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGC03ACAAA TWO 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA .7260 
AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TQOCTOCTOT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATG A GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATX^CTAAGATTTT TTTCTCCTCA GGAAGGOGTG ^^^i^^jSS^P^J^^^ TWi 

afiacagtggc ttttatcatt ttaaatggta ggaatttaag gtgtgacttc agggagaaac 7620 
aa^ctVg^aI^aaWtctcag 

GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAOOJOOACA ^7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGG A TGGGQATG AC 7800 
TTCCCTAACA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
CCC AGC^VATT TG AAC ATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
SgGGGGTCT CCCCAcScA CCCCATTTAC AGGGTGAGCT CAGOCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGmGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCTCCACCTGTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160" 
GGCTGA^VcTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
T^GTCGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
^CAaXgG ^AACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTGATCAGG AAACTAAGGC 8400 
TCCTAGGGGAGAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT M60 
TAAAKGGTT TTTTCTCACT ACAGAG ACAA TATTTTTCCA CCATTGTATC TCACATTnT 8520 
^GGAGGT TACCCATAAC AO AAG AGACT AGAGTGGAAC^WATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTCCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AA AAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 
TATCACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAQCAO 8820 
TG ACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGAOCTTCTC 8880 
SaOTCAGCTCCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCA?CT TCCAGTTCGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ImGCTCATCTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGG A AAAGAGAAGC 9060 
TGGAC\CAGC^GTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAACTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAGGAC^CACITCATCAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGTGOCTACAGCA 9240 
^TAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATOGCAC AATTAATTCC 9300 
TMOTMGTC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GA^CATAC 9360 
ATACATCAAA AACATACATG AAAAAAOGAC CCTCATCAAC ATTAGAAGGG GTAGATJTGG 9420 
AGCACTTTAG GCAGG AAA AC AGGAACGCAA GGCCAGGAAA CTGGAACCCAGTCAATACTC ^0 
AG AACCGAGG ATGCAGATGA CTTATTTAGC AAAATGGTCA CTTCTCiTGi^CATAGCrDSOAG 'f*0 

AAAGGATGGG TAACAGCTTG CCAGAGCCAC ^^^^^^^^tt^^SJ^SI^^^^fgia^ 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATOTTTAT TCATCCTCCA CTCCOCATTO 9660 
CCACACTAGC TCTTGCTOTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGOA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
gSSgGOAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT G^GGAGTGA CTCCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA ™^^^™£TLf°*?£A 9900 

AGCATGAACA G^ACGC^TT CCTAAGTCAT ^^^^^^J^^^^^^^^^^^^^^qi^ 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTG ACA GTCTGAGCrc IO20O 
CTTCnAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGA^GT 10260 
AAATCCAACT AGAAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCGC ATGCAAAGTT 10320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTG AGGTC ATAG ACTG AA CAGCAGGGAG AG AGGTTAGT 10440 
GOTGGAGTGT 3GGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
toStCTOATCCTCCGAAGC ACAGCAGAGA TAtCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
A^TCTTCTAG ^StTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTOAC TATAGAGATG 10620 
TCTTGTAG AG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGG A 10680 
AAMG^TC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTC AC AA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 

cagSgtcc^ctggggcagatacaagggaagtgcaaggg^ 

CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCTCTGGGTG GTG AG ATGTG GACTCTGATC CTTAATTGGA ATG^AG A^ACTGTC 40980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GO ACTCTTGG ATGTGGAGG A GAAAGGGTCA 1 U00 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1 160 
GGGCTTATG A ATACAAATCA GTGGGCCTTC TCTATG ATGG TCCAACAAAC TCTCAGTGTC 1 1220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT G AATAATGTC AGGTGTTCTT TGGGTCAAAG 11280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGG A 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT G AGTATAACT AGGACCCATG 11460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 1580 
AATGG ATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGG ATCATGT 1 1 640 
TTG AACTCAT TCAT ACCTGT AG AGCACGA A TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 11760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCATGAGAGCAGCC ATAACTCAGA 11880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 11940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAG A 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 124*0 
TTTTCAATTC TAGCTC AGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Seq ID NO: 98 Protein sequence: 
Protein Accession #: NP_00898U 

» 11 21 31 41 51 

GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 180 
VKVNGGPQGN PVWNVSGWT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC 1AGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWIKPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV E1VDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VGYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMT1RLRLL LSNPEGRMES EELVVQTEED 540 
VPGAVPLESl QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
KLRNETHHLF VGLYPGTTYS FTHCASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRGAPVSV YQLWKEERL QKSRRAADII ECFSVPVSYR NASSLDSLHY 720 
F AAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAG VIAGL LMFIOUGV 840 
MLTIKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLLQHTTQM 960 
KRGOG YGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYV1RTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEGWD IFNCVRELRA 1200 
ORVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LN1VTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL 1SVDGESSNY INAALMDSHK 1320 
OPAAFVVTQH PLPKTV ADFW RLVFDYMCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED UHRIFRICN MARPQDGYRI VQHLQY1GWP AYRDTPPSKR SLLKVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEM1QQQ NI1DVFH1VK TLRNNKSNMV 1500 
ETLEQYKFVY EVALEYLSSF 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession U: NM 002988.1 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codoos) 
1 11 21 31 41 51 

GOGGCACG AG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAG ATTCCA CAAAAGTTCA TAGTTG ACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTG ACCCCA ATAAG AAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGOTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGG A GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCC AGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATG ACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCA A CTCTTCGTAC ATTCAATGCA TGG ATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence. 
Protein Accession #: NP_002979. 1 

I 11 21 31 41 51 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPG VI 60 
LLTKRGRQIC ADPNKKWVQK YISDUCLNA 



Seq ID NO: 101 DNAs 

Nucleic Acid Accession #: NMOI 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGG AGC GGAGGACCCG AGCGGCTGAG G AG AG AGGAG GCGGCGGCTT AGCTGCTACG 1 80 
GGGTCCGGCC GGCGCCCTCC CG AGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTCCTTTTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTG A ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAG ACACA GAAGAAGGGC CACAGTGCCT OTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAG A G ACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCG AAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGG AC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATG AAGA GAAAATGAAA 1260 
GAGGGGCTTG AGG ATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG G ATCTGTGAC TGG AAACAGG ATAGAGAAGA TG ATTTTGAC 1500 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 
GGTCACAAG A AAG ACATTGG CCG ATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTG AAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAG A CCACGAGTGA GGATGAAAAG 1740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG C^^TTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTG G ATG ACXGAATGTT ACT ATCTTTATAT 1920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAG ATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2 100 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCOTCCT 2160 
CAGTATATCT GATTTGTATA AGTAAGTTGA TG AGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAG AT AG ACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTT A AATTCTTTGT AATAATAATA TCCAAATC AT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
Protein Accession ft NPJ>56322.2 

1 11 21 31 41 51 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPG V CHYGTKLACC YGWRRNSKG V 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 1 80 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCTDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 
PEPTRTPTPK. VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGU LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 



Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession 8: NMJD01 565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and slop codous) 

1 11 21 31 41 51 
I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 
ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 
CCTGTTAATC CAAGGTCTTT AG AAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGQ AAATGTCTAA AAGATCTCCT 360 
IAAAACCAOA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGAOCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GC TCTA CTGA GGTGCTATGT TCTTAGTGG A TGTTCTG ACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGG A ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAG ACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGQ 1080 
TTTTCAAATA AAAATG AGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 104 protein sequence; 
Protein Accession #: NPJXH556. l 

1 11 21 31 41 51 
I I I I I t 

MNQTAILICC LIFLTLSGIQ G VPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NM_0 1 5068. 1 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I I I I I 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA G AAGGGACGA GCTCTCTG AA GAGATCAACA ACTTAAGAG A GAAGGTCATG 1 80 
AAGCAGTCGG AGG AG AACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC G AG AGCAAGT GGAACCCACC CCTG AGGATG AGGATGATGA CATCGAGCTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATG A AGCATGTCTT TGAAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGG ACC TGGATTGGAA CGAGCCTGCG 720 
CTG ATTG ACC AGTACCACG A GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTG ATT GGGCAGTGCA TTCACATTGA G AG AAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTG AC A ATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTG CAAGTG A_TGCTCCAG AT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TG ATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAG AT GG ATACAGAG 1 620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAG A TCACCGCCTG GTTG ACCCTC ACATAG AAAT GATACCTGG A GCACACAGTA 1 740 
TTCCCAGTCG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GG AGCCCAAG I860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCOA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTG AA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGG AT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAG ATC ACTATATGTA CCTGTG ATGA TCACTTGG AA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGG ATCTCT GCCCTCAAAA 2280 
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TTTATTCCTO TTCAGCTTCT CAATCAGTGA CTGTaTGCTA^^MGCTA^aUTa[2M0 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA MOO 
CTGGCACACA TCCTAAAGCA CCAAAAG ACC "^CAACATTT TCTGAGAGCA ACAGAOTATT 2460 
TGCCAATAAA TCATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAAAA^IAATTAAT 2520 
AAGTTTACTT TCCAGOCAGT CCTGG AAGTC TGGGTTTTAC CTGCCAAAAC ^?*AJC*^5. ^640 
ATCTAAATTA TAGGCTGCCA AATTTGCTGT TTAACATTTA 

CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG ^92^fi^P^^^f^J^^2160^ 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGG AGGACTG AGGCOTA^ ^GG^GCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTG ATCCTCT CGCTGCACCC TrAO^AACTT 2320 
CAoStCTTC AA ACTCC ATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTOCCA G^ AGiCAGGAA^AAACCCTTGT ^^^^^^^J^^^9S^^^^?jQ^f^ 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
aStCT^^TOACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTOACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TtStca1a?GTCA^CCT CrCAATCACC TTTG AGGGCT AAAGAGTv^C^^ATCAAAAG ™ 
GACTCCAAAA TCCCATACCC AACTCTTAAC AGATTTGT^CCTGG^<^GAA^AATTT 3240 
TCATGAGTGT TCTTAATTOG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AAJCT ATCC A 3300 
TCTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
A^AGGA^OT C?TgS(^<3AG GAGAAATGAT ^AGCACC^GATircACACT ^^U^^^J 3420 
GAAGGGQGAC ATCTTTGAAG AAGAGGAATT ^^l^^^^fj^ 0 ,^' 
ACTTCCTTAG TO AGCTCCAC ATTACTTG AT GGTAACCACT TCAAAAGGAT CAG A ATCCAC 3540 
GTA^SJCkA^GG^^CrCT AGAGGATGG A G ^^^^^^ r ^PS^9^^^i?^^Q)^^^ 35g0 
rrrrAr.AAAfi f!AACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCA&CIA. joou 

a^gIca^otSccctg^ 

TAOrrTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
T^C^TaSa^GA CcScTGCA GGATTTGTGT CATT^G CCTGGTTGCT 3840 
GAGACTCCAT TTTCCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 39W 
SKSSS A^TAGAAAAC AACATGGTGT ATATCT^TTGCCTCACA^^ 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATOCGC 4020 
TrT^TTX^TT GCATCTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 

TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTAC1 °™AACTra 4200 
TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCATTTG1WT^TA^(^TCCTC i 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 
SK££g ^ItIt^at TTG AAAAGAG AAACCTAATT <^ATTTAAraA^^43 8 0 
CTlliniT T TCCTTCAATG G A ATC AG AAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTaXStGGTGCA TTTGTGCTTC TGAACTATTTTGAAGAGTCAOTCTGTTTA A 00 

aS^t^Sa^S^ 

rr^Tfn^CA^ACGGACATT T^CCACCACT GCAAGCAAAA GTCTGGAGAA GTTCACCAAC 4680 
GAC^^CG ATTAGGGA^XtGCTGCT GTGGGTTAAC AACTCAGAAA GT^ATC 4740 

Sa^tSc^ 

TTA™£I ^G^A^^Am^MTTtGMAT CTAAAGAAGT TAGAAAGGGA 5040 
S3S£Ca ^c^t^g ^gatagatg AATTAGTAAG CA^OTACTCGAGTTTT5.00 

TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AAJTAAATCA 5160 
Ar/lrr. A a£ aTgatctaAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 

ZSm A AACA CAcL ^GTITA(>TACATnATGAAGGGTCMTTTA 5280 
r^rrrpr Af* Afi^TiA^CT^rrT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 

gZgSS 

CTG^aIagSg^A GAAATGGAGAGCCCTGCGGG AACCATGCTA ■ CATTTTOVTC 55«> 
TAAAGAGAGA ATGCACATCT GATGAGACTG AAAGTTCTrrGTrGrT^AG A^G^GAAT 5640 
GGTATTGAAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTrTGTGTA J*™^^™ 
a7~t1 mfim n AT AT ATA AT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 

a^gctgSI™ga^^^ 

SgSS? £rTACTTGT AACCCTTrAC^TAATTG ^JF^^^^^o 

a ArrmnA rtfiOAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGUA1 AAA 
II^SStG GGAG a! AGTT CAAAGGTCAT TAGATTAATA ATTTAATAGA GO ATGAGTGA 6000 
^^rVrA A a^TACT^CTA G AATC AACTT GTCAATG ATC G ATGGTAAAT TTTCATGGAA 6060 
SaaA^GTG^^^ 

CACTGAACCC CATTGCCCCT ACCCCTCC1T CTAACnTAT TGCTGTATTC TCTTCACTCT 6180 
A^ATTTCTCT CTAT^GCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Scq ID NO: 106 Protein sequence: 
Protein Accession #: NP 055883. 1 

1 II 21 31 41 51 

Ln^LnELs eeinnuIekv mkqseennnlqsqvqkltee nttlreqvep TPEDEDDDIE 60 

r^^lA^AAAi^^Wi^Ei^^^D^^^^^^ > DI^APFMAOC QIFMEKSTRD FSVDRVRVCF 120 
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LARAAAARKP RSPPRALVLP HLASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession #: NM J)03679. 1 

Coding sequence: 47- 1 507( underlined sequences correspond to start and stop codons) 

\ .11 21 31 4! 51 
I I 1 I I I 

GGCACG AGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGQ ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGG AATT TCCAG ATTGA TGTATATGAA GCTAGGGAAG ATACtCGAGT 1 80 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG A AGATCAG AT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 
AATGATCCAC TCTCTTTCAG G AA A AA AGTC TGCAATTCCC TATGGG ACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGC ACT TTAACCACAG GCTGTTG AAA TGTAATCCAG AGG AAGGAAT 480 
G ATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTG ATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAG AAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTG ACT ATTCCACCTA AGAACGGAGA 660 
TTATGCCATG GAACCTA ATT ATCTGCATAT TTGGCCTAG A AATACCTTTA TG ATGATTGC 720 
ACTTCCTAAC ATG AACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 ' 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAG AG AAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGG AG ATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCG A GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 
AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAG AAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1 500 
CAGGTQATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATG AAA AA AA TGTTTCCATT GCCATATTTG ATTCACTAGT GG AAG ATAGT GTTCTGCTTA 1 620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATG AAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT G AGTCATTCT 1 740 
AACTATAAAA GTGCAATGAC TAAG ATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1 800 
TCAGGG AAG A CAGTAATCAT GCCTTTTCTT TAAAAG ACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAG AAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA G AGATCAGGT CTCACTCTGT TG ACCAGGCT GGAGTGCAGT 2220 
GGTGAG ATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTG ATCTG CCCACCTCAG CCtCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTG AAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT C AG ACCCTCA GAAGCAATTT ACTAATTTAT TCTTCG ACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATGAA GATGAAACAC TAGAGTCATA TAAOAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TG AGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGG AAAGAAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTG ACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG G AAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT G AGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAG ATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTG AAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TG ATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCG ACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTG AA ATG AAAATCT CCAACACATT AGAAGATG AT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATT AAA AG AA TTTGTTTTTG TTCAACCTCT TCCTG AGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AG AAACGGGT ATTCCAG ACA 3720 
CTTCTTATOA TGAAAGTCCA AAAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC G AACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATG ATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAOA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAG AAAAAAT AAAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGG AT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG G ATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
ATGTAGG ACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA C ACTTTCT AT 4 320 
GAGGGCAGGT ACAACTATTA AG AG ATTTTG AACATTAAGT TAGTCC ACAA ATATTCAGTG 4380 
GGCATCTACT AGGTG ACAGC CACTGTGCTA TAATTAGAG A CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AG AATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 
CATCG AATAA TATTTCTfGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 
GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACTGC ATAGGACAG A TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT G AGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATG AC CAGTAAAACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 
ATTATTTAG A GCGAAATAAA TGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Seq ID NO: 108 Protein sequence: 
Protein Accession #: NP_003670. 1 

1 11 21 31 41 51 

MDSSV1QRKK VAVZGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGROALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 
AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DLIVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTTP PKNGDYAMEP NYLHIWPRNT FMMIA1PNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPD AlPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRLRI PDDHAISDLS 360 
MYNY1EMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 
VINKGLFFLG SUAISSTYL LIHYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQI 480 
SNUSR 

Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NM_0061 15.1 

Coding sequence: 236-1 765 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 
GAGACCTAGA AATCCAAGCG TTGG AGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGG A 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 
TGCCGCCCTG G AGTTGCTGC CCAGGG AGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG G ACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTGAT GTGCTCCTTG CCCAGG AGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 
TGGTTTGAGC ACAG AGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTG AAAA TGGTGCAGCT GG ACTCTATT GAAGATTTGG AAGTGACTTG 960 
TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TG ATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT G ATG AGTGTG GG ATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
• CTTGCAG AGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
TTGGACACTA A AGCCAGG AT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG I860 
ACAAATGTTC AGTGTG AGTG AGG AAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAG A TGTTGACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGGAG ATACA TCTTATAGAG TTAG AAATAG AATCTG AATT TCTAAAGGGA 2040 
GATTCTGGCT TGGG AAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAG AGAAGC AATGTGAAGC AAAAAAAAAA AAAA AAAA 

Seq ID NO: HQ Protein sequence: 
Protein Accession ft NP_006 1 06. 1 
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1 II 21 31 41 SI 
I I I I I I 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL A1AALELLPR ELFPPLFMAA 60 
FDGRHSQTUC AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWT VWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFTPVEVLVD 1 80 
UFUCEG ACDE LFSYLIEKVK. RKKN VLRLOC KKLKIFAMPM QDIKMILKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRIXLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



Seq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession #: NM_0038 1 5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
1 I I I I I 

CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGOCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 180 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGG ATC AAGTTGGAGC TGGACGGTG A 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCG ATGGCA CTCGGGTGGT CAGTGAGGG A CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TG ACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAG AG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGG ACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TG ACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1 200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1 320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 1380 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGOACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1 740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA I860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAG ACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGG AATGTCG 1 980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACec CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTQA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2160 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTG AAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCA7CGGGTG ACCTGCCCGG CCCAGGQGCT OGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CrCTACCTCTGACCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGG ACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 f rotcin sequence: 
Protein Accession #: NP_003 806.2 

I II 21 31 41 51 

MRLALLWALG LLG AGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLVVLTPERS YTLEQGPGDL QGPPHSRIQ DLHLPGHTCA 1 80 
LSWRESVHTQ TPPEHPLGQR HIRRRRDWT ETKTVELVTV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLKF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLOTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRDLLW 600 
ETIDVNGTEL NCSWVHLDLG SDV AQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTOLLL SLLVLLVLVM LG AG YWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGP AKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 

Seq ID NO: 1 13 DNA sequence 

Nucleic Acid Accession »: NM 0024 1 6 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATCCAATACA GGAGTG ACTT GG AACTCCAT TCTATCACTATGAAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 
AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CA AGGGACTA TCCACCTACA ATCCTTG AAA 1 80 
GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTCAAATCATTGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATOAA 300 
AAGTGGG AGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
aI^AAAGTTC TCaStCG AAAATCTCAA CGTTCTCGTC AAAAGAAGACTACA3MGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
rrCCAAAGG A GG ATGGCATA TAATACA AAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 
ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA ^ 600 
TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAG AT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
TATCTG AGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT ™^TCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTG AG A GATTCATGCT GATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGG A AGCCA^GA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 
AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCT.CCA GTGCACCTGT 1 140 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT ^1260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGG AACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTACTCCATGTTGG 560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGG ACAC TCACTACAGG GACCAGGGAT 1620 
GATGCAACAT CCTTGTCTTT TTATG ACAGG ATGTTTGCTC AGCTTCTCCA ACAAT^GAA 1680 
GCACGTGGTA AAACACTTGC GG ATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 
AAAATCATAT AATCTTACAA TG AAAAGGAC TTTATAG ATC AGCCAGTGAC CAACCTTTTC 800 
CCAACCATAC AAAAATTCCT TTTCCCG AAG GAA AAGGGCT TTCTCAATAA GCCTCAGCTT I860 
TCTAAGATCT AACAAGATAG CCACCG AGAT CCTTATCG A A ACTCATTTTA GGCA^TATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 
TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CC^GTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGG A TGTGGAAACC ^CTTCCAGG 2 00 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AG AATTT AAA AT^ACTCA 2 1 60 
CTTTCCCAAA TTG AATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCMGTGGAGA 2220 
TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT^AACTTW^G 2280 
AAAAATCTAA GTGTTTCATA AATTTGAG AG TCTGTG ACCC *™C^?™^™* 2 ™ 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTG AC AC ACACGTTAT AA 240° 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA ^^^^^^^^"^^^T^^^TJH^ *£ 
AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAG ACCATTA ATCAG 

Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NP_O02407 

1 11 21 31 41 51 

I^gviAlgiillvligvqgtpvvrkgrcscistnqgt «o 

^^GWQICUWDSA DVKELKKWE KQVSQKXKQK NGKKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

Seq ID NO: 1 1 5 DNA sequence 

Nucleic Acid Accession*: NMJJ03238.1 

Coding sequence: 182-1426 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

r"AAGCAGGAT ACGTTTTTCT GTTGGGCATT G ACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
A^AA^AAAC^ ACA^Sa AAAACCAAAC A ACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTG ATTTCTT TTTTTTTATT CTGACTTTTA AAAACAACTT TTTTTTCCAC TTTTTTAAAA 180 
AATGCACTAC TGTGTGCTGA GCGCTTTTCT GATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCG ATATGGA CCAGTTCATG CGCAAG AGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTG AGCC 360 
5 CGAGGAAGTC CCCCCGGAGG TG ATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGG AGGGCGG CCGCCTGCG A GCGCG AGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TACAAAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACITTCTAC AGACCCTACT TCAG AATTGT TCG ATTTG AC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTG AAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 

10 AGTGCCTGAA CAACGGATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATG AATG GCTTCACCAT AAAG ACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG A AG CA AG ATT TGCAGGTATT G ATGGCACCT CCACATATAC 960 

1 5 CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
G A AGCGTGCT TTGG ATGCGG CCTATTGCTT TAG AAATGTG CAGG ATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTG ATTTCA AGAGG GATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 

20 CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
CGTGTCCCAA GATTTAG AAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TG AACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAG A A AAC AT A AG A GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 

25 GGTACTAGTT CAGACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

_ Seq ID NO: 116 Protein sequence: 
3 0 Protein Accession #: NPJW3229. 1 

1 U 21 31 41 51 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 
3 5 EEVPPEVISI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRPYFRiV RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP 180 
TQRY1DSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTDCST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
40 SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 
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Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession #: NM JKJ0095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCG ACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGG ACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

50 GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTG ACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

55 CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAO TGCGGCCCGT GCCAGCCCGO 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

60 CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTG ACCT 840 
AOACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAG ACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

65 GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGACG ACATC G ACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGG GGAT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

70 CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGQ GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACG A TGGCCAGGGT GATGCCTGCG ACG ACGACG A 1440 
CGACAATGAC GG AGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG G ACAGGG ACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

75 GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGG ACCCGCC GGATGACAGC CACCCTCACC 2340 
GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 



SeqIDNO: 118 protein sequence: 
Protein Accession NPJJ00086. 1 



1 U 21 31 41 51 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVRETT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCP AGFTG 120 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQG VGLAF AKANKQVCTD 1 80 
TNECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
ED VDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKVVDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLGVFCFSQE NIIW ANLRYR CNDTIPEDYE THQLRQA 

Scq ID NO: 1 19 DNA sequence 

Nucleic Acid Accession U: NM_0!421 1 

Coding sequence: 157-1 479 (underlined sequences correspond to start and stop codons) 
I I! 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAG AGCC TCAACAACTA CCTGGTGATT 120 
CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC I B0 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGC AT TTCAG AG AGT AACATGG ACT ACACAGCCAC CATATACCTC 420 
CG ACAGCGCT GG ATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGG AAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC G ACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AG ACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 
GAGTTCACCT GGCTGAG AGG GAACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTTA GTCACCAG AT CGCAGCAGGA GACAGGAAAT 840 
TACACTAG AT TGGTCTT AC A GTTTG AGCTT CGG AGGAATG TTCTGTATTT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACGACCG TGTTATCAAT GACCACACTG 1020 
ATG ATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAOAATATGC AGTTGCTCAC 1140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA G ATAGGGGGA CAACAAAGG A AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
GACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTAATGTTG A TCACTATTCC A AACTAC TGT TTCCTTTG AT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AA ATAAT ATT TG AGTCTTTC TGCTCAAAGA 1620 
ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGG A CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1800 
AAG AATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTCTTT ^^GTAAC 1860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTO CCTATTTAGG 1920 
GACTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATG AAA TATGGGCTTA TGTCAATTCA 1980 

t^a^ctS IIgcact^ 2 «0 

ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 
ACTG^G^/S^GTAG ^T^CMTC TAAACAAATC (XTC^GTTCT AAG ATACAAT 2160 
G^T^C^T^ A7GG ACVCTGAGGC TTTATTCCCC ^CTATCCAT ^TCTTATCAT 2220 
^at-tTtta TACArACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
G^T^A^GA^T^ 2340 

aIg^ca^^ 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
G AGCAACACT CTCCCAGTGG CAOATCCCCT GTATCATTCC AAG AGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
5 TGAGTGGCTT ATCCCGCATG AGCAGG AGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG A ACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCT CCAAA T 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TlTAGACAAT CrCGCCATC I 2880 
TTAATTTCTT AGCTTCCTGT TCTAATA AAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 0 ACCAAGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AG AGCCACAG OTTCTCAT TC 3000 
CTTTTCCCAT TATACT7CTC ACAATTCAGT TTCTATGAGT TTGATCACCT GAIMIllTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAG ATGA AATGTGGTTG 3120 • 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

1 5 AATATACCAT ATTAGCTACC C ACC 

Seq ID NO: 120 Protein sequence: 
Protein Accession #: NP_055026. 1 

20 1 11 21 31 41 51 

MNYSLHLAFV CLSLFTERMC IQGSQFNVE V GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLW VPDT 1 20 
YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALRTTTTVAC NMDLSKYPMD TQTCKLQLES 1 80 
25 WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLV VLS W VSFWISLDS VPARTC1G VT TVLSMTTLMI GSRTSLPNTN 300 
CFDCAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNUNSS1S 360 
SFKRKJSFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLIFMLAN VFYWAYYMYF 

30 

Seq ID NO: 1 2 1 DNA sequence 

Nucleic Acid Accession #: NMJJOl 854 

Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 
35 1 II 21 31 41 51 

AACCATCAAA TTTAG AAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 
40 GGTGOAAAAC G AAACGGTGG CTCTGGG ATT TCACOGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAG AGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AG ATACTG CT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

45 CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTG A GGTTGGG AGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TG ACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAG AAG AAAACTGTGA CAATG ATTGT TG ATTGTAAG AAGAAAACCA 720 
CGAAACCACT TGATAGAAGT G AG AG AGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

50 GAACAAGG AT TTTGGATGAA GAAGTTTTTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 
GTG ATCCCAA GGCAGC ATAT G ACTACTGTG AGCATTATAG TCCAG ACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AG ATAGATG A GTATGCACCA GAGG ATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 

55 AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAG AATATG A AG ATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

60 CAGAAACTG A TATTACAG AA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGG ACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGO TGGTGATGGT TCCAAAGGAC 1680 

65 CAACCATCTC TGCTC AGG AA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT COG ATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGQ TCOTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTQAT GG ACTTCCGG 1980 

70 GTCTGCCAGG TG ACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGG AATG AGG GG AG AAGATG G AG AAATTGG ACCAAGAGOT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGG AC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGQ TCTTOCTGGT CCACAAGGTC 2280 

75 CAATTGGTCC TCCTGGTG AA AA AGGACCAC AAGGAAAACC AGGACTTGCT GG ACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAG AAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAG AT GGTTTTCtAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA ^™™ A A ™CK^ *f *° n 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG <XX^<£*™^^ }™ 
CTTCAGGTCA AGCAGGAG AA AAGGG AAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGG/^AGA 2760 
AAGGTGCACG GGG AGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGC A AG AGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
ACCCTGGGCA ACGTGGGG AG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCC^GGGG 3060 
GAGTGGTTGG ACCACAGGG A CCAACCGGTG AGACTGGTCC AATAGGGGAA CX3TGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3 180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 
AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGOTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTC 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 
GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTa 3660 
ATGGTG AACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCC^ GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG AOXXXAGGT TCTGTOOT 3900 
CAGTTGGTGC TGTTGGAGAA AAGGGTG AAC CTGG AG AAGC AGGAAACCCA GGGOTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAAGCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GG(XCTGCAG4 140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGG AGA TCCTG GTC AA ODGGGTCCTC 4200 
CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 
CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCAGGTGCAG AAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGG ACCTC CTGGCTTACC TGGTCTCAAA G^TGACCCTG 4500 
GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTAOCAG 4680 
GTGCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGG ACC CGCTGGCCAG AAAG^TGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT fAGC^AC 4800 
CAATCTTGTC CTCCAAAAAA ACG AGAAG AC ATACTG AAGG CATGCAAGCA GAT^GATG 4860 
ATAATATTCT TG ATTACTCG G ATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
A^C^AG^SaVcGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA G^A^ 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAG ATGG ™ 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTC AC ATCTG GTG GTG AG A 5 1 00 
CTTGCATTTA TCCAG ACAAA AAATCTG AGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 
AACCAGGAAG TTGGTTTAGT G AATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGG AAATTC CATC AATATG GTGCAAATGA CATTCCTGAA 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTA^^^I.^ 0 ^ 
GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATG ATGA GGAGATGTCC TATG ACAATA 5400 
ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG A^GGCTAT GA^^CTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TCTTGATGTC^^AGTG 5520 
ACTTTGGTG A TCAGAATCAG AAGTTCGG AT TTGAAGTTGG TCCTOrTTGT T^™^™ 
AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ' AC^GTGC L^^CCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA ^ A ^^ A C ^^^^ 5 ™ 
TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CA^ACAAA AGCTTJGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ^GGAAACAG ^^^^JT^^^^^^^^q 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGA^CA AAA^AGAAA 5880 
AATATATATT CATAA AAA AT ATGGTGCTCA TTCTCATCCA ^ C ^GATCT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ^JJ^^^^r^^ 
CCAAAACTTG CACGTGTCCC TGAATTCCGC TG ACTCTAAT TTATGAGG ^ <3^A^CT 6060 
GATGGCAATA ATATATGTAT TATG AAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATG ATGAA ATTCCTTTGT GTGTGTTT 



Seq ID KO: 122 Protein sequence: 
Protein Accession #: NPJOO 1 845 



I 11 21 31 41 51 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD V^LDT^S PEG IS^F 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
^G^QIG VE VG I^P VF LF E DHTTG KP AP ED YPLFRTVNIA DGKWHRVA1S VFXKTVTMIV 180 
^^^LD^^rmiTV?G-m LDEEVFEGDI O^FUTGDr^AAYWCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDI1EYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 
KfvDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYT^QR^SEDTLY 360 
PNKFJDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPHEE FGPGVPAETD ITETSINGHG 420 
XySeKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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G ADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPlGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFlCG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPO ERGSAGTAGP IGLRGRPGPQ 1 080 
GPPGPAGEKG APGEKGFQGF AGRDGVQGFV GLFGFAGPAG SFGEDGDK.GE IGEPGQKGSK. ! 140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPG AAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGUGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 
EYWIDPNQGC SGDSFICVYCN FTSGGETCrY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK. ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKJDQVPI VDVM1SDFGD QNQKFGFEVG 1800 
PVCFLG 

Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession #: NMJH 5886 

Coding sequence: 485-1261 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GAATTCCCOC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTG AA TTTCAATTGC 120 
TG AAA A TTTC CCACTGA A A A TATGCAGTAA TATATTTTGT GGTTCA GACA TTTGGGGCA A 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTG AGTTCA ACATTTGTTT GGATTTTTGA AG AAAGTCAA ATCAAGC AAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAG A TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAG AATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CA AATATGGA ATATATGGTT TGGG ATG AAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TG AAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATG AGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCG A CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AOAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 

Seq ID NO: 124 Protein sequence: 
Protein Accession #: NPJ056970.1 

1 11 21 31 41 51 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAALKAQLDS ADIPKARRKR 60 
Y1SQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVG VPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession*: NM 001793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCC££GGGGC 60 
TCCCTCX3TGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCOGGCG GTCTTCAGGG AGGCTGAAGT G ACCTTGG AG GCGGG AGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

404 
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CAGCTCTGTT TAGCACTG AT AATG ATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGG AA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA G AGACTG AAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAG AAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATG AGC 600 
TCTTTGGCCA CGCTGTGTCA G AG AATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CGACCAG AAT G ACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 
10 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGG AGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
1 5 GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTT TACCA TC ACCACOCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTG ACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
20 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTG ACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAG ACCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAG ACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 
25 ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTC AG ACATC TACTGG ACGG CAG AGGTCAA CG AGG AAGGT G ACACAGTGG 1 860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAG AGCAG CTG ACGGTG A TCAGGGCCAC TGTGTGCG AC TGCCATGGCC 1980 
3 0 ATGTCGAAAC CTG CCCTG G A CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 
3 5 TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT G AGAACCTG A 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAGATTA CG ATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
40 CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTG AC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAG AGGCCAA GTTTCCAGAA GCCTCTTACC T GCCGT AAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTG ACTG ACCTACAGTG G ACTTTCTCT 2880 
45 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTC AAAAGT GCAGCCCAG A GCTGCTGGGC CCA CTGGCC G 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTAT TTTC C CTGTTGCOT TOCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 



50 



70 



Scq ID NO: 126 Protein sequence: 
Protein Accession #: NP_001784 



1 II 21 31 41 51 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
OEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NGASVEDPMN IS1IVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
60 DEDDAIYTYN G WAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIV VH VEDVNEAP VF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 
65 VNDHGP VPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDrY WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVTRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: NM_0032S6.1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 



75 1 U 21 31 41 51 

CCTGCTGGGG CCGTCCAGTC CCCCAG ACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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TGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAG AAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTG ACACT GAA AAAATGC TCCGGTATG A AATCA AACAG ATAA AGATGT 300 
5 TCAAAGGGTT TGAGAA AGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG A AAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGG AAAGT CTG AATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGG ACAGACT 600 

1 0 GGCTGTTGGA ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATG AAGCATG 660 
TTf; AC.GGCAC CTG C AG CTG G TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGG A CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAG ATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 

15 TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCC AGG AAT ATGTGTT TTCCTTTGTC TTAATCG ATC TGGTAGGGG A G AAATGGCG A 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTG ACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

20 

Seq ID NO: 128 Protein sequence: 
Protein Accession #: NP_003247.1 

25 1 11 21 31 1 4I | 51 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA KISSEKWPA 60 
SADPADTEKM LRYEUCQIKM FKGFEK.VKDV QYIYTPFDSS LCGVK LEANS QKQYLLTGQV 120 
LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 180 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

30 

Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 
35 1 U 21 31 41 51 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGGAGCGCCA GATGGTGGAG GAATACACTT ATTTATGAAA CTGTCTTGAG TTCTTCTTGA 120 
ATTGCCAGTT TTCAGCCTCC TCATGCCTCC GTCTCCTTTA GACG ACAGGG TAGTAGTGGC 1 80 

40 ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTG ACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 

45 TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGG A CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATG A CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTG AC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG G AGCTGTCCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTOTCCT 780 

50 AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT G ATGAGAATA CCA ATG AACC AAGCCG AGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC G AGTCCCTGA AG AGAG AAGG CAAAGAACCT CTGGTGTTG A AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCG AGCTTG CTACCTCAGC CCATCCCCAC 1080 

5 5 CACCCCTG AC ATCGAGAACG CTG AGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1 140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGTT V320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 

60 CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATOAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACG ACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTGXGj\CAATGGTCT GOATGGA AAG GATTGCTGCT 1620 
CTCCATTAGG AGACAATGAG GAAGGAGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1680 

65 TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAA AGAAGT A A TI 11 11 1 1 ' AAGCCCAACC ATTAAAAATT TAATACAACT 1 860 
TGGTTTCTCC CCCT TT T TC C TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

70 m 130 Protein sequence: 

Protein Accession #: NP_009 138.1 



75 MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANQMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 180 



11 



21 



31 



41 



51 
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MEYNKSHIQG AVHINCADKI SRRRLQQGKJ TVLDUSCRE GKDSFKR1FS KEHVYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLN1GY VINVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLUHCQA GVSRSATIVI 420 
5 AYLMKHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMGVET 480 
W 

SeqIDN0:131 DNA sequence 
Nucleic Acid Accession »: NM_005409 3 

1 0 Coding sequence: 94-378 (undeiiined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

WcCTTTCAT GTTCAGCATT TCTACTCCTT CCAAG AAGAG CAGCAAAGCT GAAGTAGCAG 60 

15 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTG AC AAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 
AAAGGACAAC G ATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

20 GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTG ACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTTGT G ACTTTCAAC TTTTGTACAG TTATGTGAAG G ATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

25 GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 
GCATTTCTAG GCTAG AGAAC CTTCTAG ATT TGATGCTTAC AACTATTCTG TTGTG ACTAT 780 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGGAG ACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

30 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 
TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 
TGGGATACTG GCAAC AGTG C ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1 140 
GATGTTTTt C AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATG ATAGCAA TATCTTGGAC ACATTTGAAA 1260 

35 TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAA ATTTAG GTAT ACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 Seq ID NO: 132 Protein sequence: 

Protein Accession ft: NPJJO5400. 1 

1 11 21 31 41 51 

45 MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVAD1EKASI MYPSNNCDK1 60 
EVHTUCENK GQRCLNPKSK QARL11KKVE RKNF 

Seq ID NO 133 DNA sequence 
Nucleic Acid Accession »: NM_012342 
5 0 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 

5 5 GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCOGAC CCGGGGCTAG 120 
CGTGCGTCCC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGGA 180 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 

60 TGCGGGGCGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCC AC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAOACTTCTT 540 
GATCCTCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGG ACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 

65 GAATGCTGTC ATG AAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 
AGGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTGC AGG AGCTG AC TTCTTCCAAA G AGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT G ATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAGAGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 

70 AGCTTTCACG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAGACTT GGAATGCATG 1020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAGATCCT CTCGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGGAAG 1140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTG AGTTCTG CTGG ACAGG A GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1260 

75 AGACAAAATG ACCTCTGCAA ACAGAATCTT GGATATTTCT TCTGAAGGAT TATTTGCACA 1320 
GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
5 Protein Accession tf: NP 036474.1 

I 11 21 31 41 51 

MDRHSSYIFI WLQLELC AM A VLLTKG EIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
10 SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNLITKVQ ELTSSKELWF RAAVIAVFiA GGULVLUM LALRMLRSEN 1 SO 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

1 5 Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession U: NM_00 1627.1 

Coding sequence: 64-1815 (underlined sequences correspond to start and stop codons) 

1 || 21 31 41 51 

20 I I I 1 I I 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCA AG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGG ATGGTAT ACTGTAAATT CAGCATATGG AG ATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 
25 AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACG ATG TACCAG AATA CAAAG ACAG A TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGG ATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CG AAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
3 0 GAAGACAGTT ATCCAG ATGG CA ATATCACA TGGTACAGG A ATGGAAA AGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTT AAA AAGGAAATGG ACCCAGTG AC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTG A ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
35 ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC G AAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTGAT AGACAAAAAA 1020 
AGCATG ATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAGAT TGGTGATGCC CT ACCCGTGT CATGCACAAT ATCTGCTAGC 1140 
40 AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAG A GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAG A AAACTG ATCC CAGTGG ACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 
45 GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAG AAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 
50 CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATCAC 1800 
AAAACTG AAG CCTAAGAG AG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAG ACC 1860 
AATTGAAGCA TGAACGTGG A TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATT AAG C AGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AG CC ATG AT A 2040 
55 ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGT CTGCA CTGAGGATTT 2100 
CITTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2 1 60 
TGTTCTGGGA GTTGCTGTTA ATCTG ATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTG AT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAA ACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTO 2340 
60 CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
~ GACACCATAG GAGCCG ACTC TTTG ATATGC CACCAGCGAA CTCTCAG AAA TAAATCACAG 2460 
ATG CATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

65 -Seq TP NO: 136 Protein sequence: 

Protein Accession #: NP JJ01 6 1 8. 1 

1 11 21 31 41 51 

70 MESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVF1AF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLGDCISE DSYPDGN1TW YRNGKVLHPL 180 
EGAWIIFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDN1TLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 

75 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATWWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQDCM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWTIT GSGSVINQTE ESPYINGRYY SKMSPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKLIV GIWGLLLAA 540 
LVAGWYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 

Scq ID NO: 137 DNA sequence 

Nucleic Acid Accession*: XM_030559 

Coding sequence: 1-1 19 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATCAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGG AAAATTT 120 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA <^TTrTGGTA 180 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATG ATAA TTATCACAAA 240 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCT TTGGTACAG A CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGAOTTAGA 420 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CACTGTCAGG 540 
GTAACACCAC ATGGCTTAGA AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAGA AGTTAAT^GGC 660 
ATAG AAGTTT CAGGG AAG AG CCTTGATCAA GTAACAG ACA TGATGATTGC AAATAGCCGT 720 
aI^StAACAGTGAG ACCGGCAAAC CAGAGGAATA ^™ G ™ G «^?™ G ™° 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACTC AJ^GCAGATT 840 
GAACCAAGCT TTG AGCCAGA GGATGAAGAC AGCGAAGAAG ATGACATTATCATTGAAGAC 900 
AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA ™ 
ACACAGATAG AGCTAAGCTT TGAGTCTGG A CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGC^AGCAG CCATAGCAAG GAGCTCAAAC ACGGAATTTO AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

Scq ID NO: 138 Protein sequence: 
Protein Accession*: XPJ)30559 

1 11 21 31 41 51 

Lrshrhgag SGCLGTMEVK SKFGAEFRRF SLERSKPGKF eefygllqi^ ^n^ v , « 
GYADIHGDLL PINNDDNYHK AVSTANPLLR EFIQKKEEAD YSAFGTDTU IOCKWLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSSUDVDI LPETHRRVRL YKYGTEKPLG FY1RDGSSVR 180 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG 1EVSGKSLDQ VTDMMIANSR 240 
NL1ITVRPAN QRNNWRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDIHED 300 
NGVPQQCPKA VPNTESLESL TQ1ELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTUTL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3 . The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1 , wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 1 4. An antibody that specifically binds a polypeptide of claim 1 3 . 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 1 6. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 1 5, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 21 , wherein the effector component is a 

2 fluorescent label. 

1 23 . A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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(i) administering a test compound to a mammal having breast cancer or a cell 
isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 
treated cell or mammal with the level of gene expression of the polynucleotide in a control 
cell or mammal, wherein a test compound that modulates the level of expression of the 
polynucleotide is a candidate for the treatment of breast cancer. 
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